DECchip21071 and DECchip21072 
Core Logic Chipsets 

Data Sheet 

Order Number: EC-QAEMB-TE 



Revision/Update Information: This document supersedes the 

DECchip 21071 and DECchip 21072 
Core L ogic Chips^s Data Sheet, 
(EC-QAEMA-TE). 



Digital Equipment Corporation 
Maynard, Massachusetts 



J anuary 1996 

While Digital believes the information included in this publication is correct as of the date of 
publication, it is subject to change without notice. 

Digital Equipment Corporation makes no representations that the use of its products in the 
manner described in this publication will not infringe on existing or future patent rights, nor do 
the descriptions contained in this publication imply the granting of licenses to mal<e, use, or sell 
equipment or software in accordance with the description. 

©Digital Equipment Corporation 1993, 1994, 1996. All Rights Reserved. AlphaGeneration, 
Digital, Digital Semiconductor, OpenVMS, VAX, VAX DOCUMENT, the AlphaGeneration design 
marl<, and the DIGITAL logo are trademarl<s of Digital Equipment Corporation. 

Digital Semiconductor is a Digital Equipment Corporation business. 

Intel is a trademark of Intel Corporation. 
RamDAC is a trademark of Brooktree Corporation. 

All other trademarks and registered trademarks are the property of their respective owners. 



This document was prepared using VAX DOCUMENT Version 2.1. 



DECchip 21071 and DECchip 21072 
Core Logic Chipsets 
Data Sheet 



21071 and 21072 Features: 

• Supports the entire family of DECchip 
21064 Alpha AXP microprocessors 

• DECchip 21 071 : 1 28-bit cache/64-bit 
memory 

• DECchip 21 072: 1 28-bit cache/1 28-bit 
memory 

• System clock frequency up to 33 MHz 

• Bcache/memory controller 

- Write-back cache 

- Bcache size from 128 KB to 16 MB 

- Bcache SRAMs, 17 ns and faster 

- 32-bit parity/32-bit ECC on Bcache 
(DECchip 21072 only) 

- 8 MB to 4 GB of memory supported 



- 267 MB/s CPU write bandwidth, 
107 MB/s CPU read bandwidth 

- 32-bit parity/32-bit ECC on memory 
data (DECchip 21072 only) 

- RAS/CAS memory bus to industry- 
standard SIMMs 

- DRAM controller with fully 
programmable timing with 15 ns 
granularity 

High-performance PCI bridge 
120 MB/s DMA write bandwidth, 70 
MB/s DMA read bandwidth, 82 MB/s 
programmed I/O write bandwidth, 22 
MB/s programmed I/O read bandwidth 

Graphics support 



The DECchip 21071 and DECchip 21072 core logic chipsets provide a cost-effective solution 
for designing uniprocessor systems using the DECchip 21064 family of Alpha AXP micro- 
processors. The chipsets include a secondary cache and memory controller, PCI interface, 
and corresponding data path functions. The chipsets provide ample flexibility to the system 
designer in building the memory and I/O subsystem, and they require minimal discrete logic 
on the module. The 21071 and 21072 chipsets contain three unique gate arrays: 



• 21071-CA (Cache/memory controller) - 208 PQFP 

• 21071-DA (PCI interface) - 208 PQFP 



• 21 071 -BA (Data path) - 208 PQFP 
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Preface 



Purpose and Audience 

This document is a support and reference document for engineers wino design 
uniprocessor systems using an Alplna 21064 microprocessor. 

Organization 

Tliis document is divided into tiie following parts: 

An overview of the DECchip 21071 and DECchip 21072 core logic chipsets 
precedes Part I. 

Part I contains information about the DECchip 21071-CA chip. 

Part II contains information about the DECchip 21071-DA chip. 

Part III contains information about the DECchip 21071-BA chip. 

Appendix A contains PAL programming equations. 

This document contains the following chapters: 

Chapter 1 provides a brief overview of the DECchip 21071 and DECchip 
21072 core logic chipset features. 

Chapter 2 describes the DECchip 21071-CA pin signals. 

Chapter 3 describes the DECchip 21071-CA architecture. 

Chapter 4 describes the DECchip 21071-CA control and status registers. 

Chapter 5 describes the transactions supported by the DECchip 21071-CA 
chip on the sysBus and memory interface. 

Chapter 6 describes the DECchip 21071-CA electrical requirements. 

Chapter 7 describes the behavior of the DECchip 21071-CA chip during 
power-up. 

Chapter 8 describes the DECchip 21071-DA pin signals. 



Chapter 9 describes the DECchip 21071-DA architecture. 

Chapter 10 describes the the DECchip 21071-DA control and status 
registers. 

Chapter 11 describes the transaction flows supported by the DECchip 
21071-DA chip. 

Chapter 12 describes the DECchip 21071-DA electrical requirements. 

Chapter 13 describes the behavior of the DECchip 21071-DA chip during 
power-up. 

Chapter 14 describes the DECchip 21071-BA pin signals. 

Chapter 15 describes the DECchip 21071-BA architecture. 

Chapter 16 describes the flow of data within the DECchip 21071-BA chip 
for various transactions on thesysBus, memory data bus, and PCI bus. 

Chapter 17 describes the DECchip 21071-BA electrical requirements. 

Chapter 18 describes the behavior of the DECchip 21071-BA chip during 
power-up. 

Conventions Used in this Document 

The following conventions are used in this document: 



Convention 



Meaning 



Note 

Caution 

Warning 

Numbering 

Ranges 
Extents 



Provides general information that could be useful. 

Provides information to prevent damage to equipment. 

Provides information to prevent personal injury. 

All numbers are decimal unless otherwise indicated. Numbers 
other than decimal are indicated with the name of the base 
following the number in parentheses. For example: FF (hex) 

Ranges are specified by a pair of numbers separated by two 
periods (..), and are inclusive. For example, a range of integers 
0..4 includes the integers 0, 1, 2, 3 and 4. 

Extents are specified by a pair of numbers in angle brackets 
separated by a colon (:), and are inclusive. For example, bits 
<7:3> specify an extent of bits including bits 7, 6, 5, 4, and 3. 



Convention 



Meaning 



Clock edges References to the rising and falling edges of clocks as defined by 

specifying the clock name followed by an R or F. For example, the 
rising edge of clkl is referred to as clklR and the falling edge of 
memClk is referred to as memClkF. 

Signal edges References to the assertion and deassertion of signals are defined 

by using the (a) and (v) characters to indicate signal rising and 
falling edges. For example, the deassertion of memRASJ is 
referred to as memRASjA. 

sysBus Refers to the DECchip 21064 pin bus (data, address, and 

controls) and the control signals between and the DECchip 
21071-BA, DECchip 21071-CA, and the DECchip 21071-DA (or 
any other 1/0 bridge). 

memClk cycle Defined as the time from a memClk rising edge up to the next 

memClk rising edge. If a signal transitions in either the rising 
or falling edge of cycle N, then the signal is defined as occurring 
in cycle N. 



GCR 


Refers to general control register 


Bcache 


Refers to backup cache (secondary cache). 


TLB 


Refers to Translation lookaside buffer. 


Byte 


Contains 8 bits. 


Word 


Contains 16 bits. 


Longword 


Contains 32 bits. 


Quadword 


Contains 64 bits. 


Octaword 


Contains 128 bits. 


H exaword 


Contains 256 bits (the size of one cache line) 



1 

DECchip 21071 and DECchip 21072 Core 

Logic Chipset Overview 



1.1 DECchip 21071 and DECchip 21072 Core Logic Chipset 
Features 

The DECchip 21071 and DECchip 21072 core logic chipsets provide a cost- 
competitive solution for designing uniprocessor systems that use the Alpha 
21064 microprocessor. The DECchip 21071 chipset provides a 64-bit memory 
interface; the DECchip 21072 provides a 128-bit memory interface. 

The chipsets include a Bcache (secondary cache) and memory controller, 
PCI interface, and corresponding data path functions. They provide ample 
flexibility to the system designer in building the memory and I/O subsystem 
and require minimal discrete logic on the module. The DECchip 21071 and 
DECchip 21072 chipsets contain three unique gate arrays: 

• DECchip 21071-CA (cache/memory controller) - 208 PQFP 

• DECchip 21071-DA (PCI interface) - 208 PQFP 

• DECchip 21071-BA (data path) - 208 PQFP 

The following list summarizes the major features of the DECchip 21071 and 
the DECchip 21072 chipsets: 

• Supports the Alpha 21064 microprocessor 

• DECchip 21071 chipset: 

- Supports 128-bit cache/64-bit memory 

- Contains two DECchip 21071-BA chips 

- Contains one DECchip 21071-CA chip 

- Contains one DECchip 21071-DA chip 
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DECchip 21072 chipset: 

- Supports 128-bit cache/128-bit memory 

- Contains four DECcinip 21071-BA chips 

- Contains one DECchip 21071-CA chip 

- Contains one DECchip 21071-DA chip 
System clock frequency up to 33 IM Hz 
Bcache (secondary cacheVmemory controller: 

- Write-back cache 

- Bcache size from 128 KB to 16 MB 

- Bcache SRAMs, 17 ns and faster 

- 32-bit parity/32-bit ECC on Bcache 

- 8MB to 4GB of memory supported 

- 267 M B/s CPU write bandwidth, 107 M B/s CPU read bandwidth 

- 32-bit parity/32-bit ECC on memory data (DECchip 21072 chipset only) 

- RAS/CAS memory bus to industry-standard SIMMs 

- DRAM controller with fully programmable timing with 15 ns 
granularity 

- Optional cache allocates for CPU writes 
High-performance PCI bridge: 

- 32-bit multiplexed address/data 

- I ndustry standard 

- No glue logic needed to connect PCI -compliant chips 

- 120 MB/s DMA write bandwidth, 70 MB/s DMA read bandwidth, 82 
MB/s programmed I/O write bandwidth, 22 MB/s programmed I/O read 
bandwidth 

- Scatter/gather map support 
Graphics support: 

- H igh bandwidth memory data path to video RAM (VRAM ) 

- Provides support for direct connection to VRAM frame buffer 
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1.2 System Overview 

Figure 1-1 shows a block diagram of a system tiiat is built using the DECchip 
21071 and DECchip 21072 chipsets. 

Figure 1-1 DECchip 21071 and DECcliip 21072 System Block Diagram 
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The system is built using the following components: 
DECchip 21064 microprocessor 
DECchip 21071-BA chip 
DECchip 21071-CA chip 
DECchip 21071-DA chip 
Bcachedata and tag RAMs 
Bcache control PALs 
Cache address buffers 
System clock generator 
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Serial ROM interface 

Interrupt control/CPU configuration PALs 

Memory SIMMs 

PCI interrupt controller 

PCI peripherals 

PCI arbiter 

System ROM 

1.2.1 Alpha 21064 Microprocessor 

The DECchip 21071 and DECchip 21072 chipsets support the Alpha 21064 
microprocessor. The microprocessor can run at cycle times which range 
between 3.3 ns and 10 ns. 

The Alpha 21064 microprocessor contains two on-chip 8 KB direct-mapped 
caches, one for use as an instruction cache (I cache), the other for use as a data 
cache (Dcache). For details about the Alpha 21064 microprocessor, refer to the 
Alpha 21064 and Alpha 21064A Microprocessors H ardware Reference M anual . 

1 .2.2 Bcache Data and Tag RAMs 

The DECchip 21071 chipset supports an optional write-back Bcache (secondary 
cache). System performance improves if the optional write-back Bcache is 
included. The size of the Bcache can range from 128 KB to 16 MB and the 
cache-line size is fixed at 32 bytes. The Bcache RAM data width is 128 bits. 
The speed of the Bcache RAMs generally ranges from 10 ns to 17 ns depending 
on cost and performance requirements, module routing delays of the targeted 
system, and the system clock cycle time. 

The only restriction that the DECchip 21071 and DECchip 21072 chipsets 
place on the speed of the Bcache is that a read from the cache RAMs must be 
completed in one system clock cycle. 

1.2.3 Bcache Control PALs 

Systems that use the 21071-CA (cache/memory controller) chip and Bcache 
(secondary cache) need to implement two Bcache control PALs; these control 
PALs provide the tag and data RAMs with output enables, write enables, and 
lower address bits. 
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The Bcache control PALs are used to: 

• Implement the NOR function between the processor-generated cache 
control signals and the system cache control signals. 

• Generate timing of system cache control signals, so that the cache access 
loop generated by the 21071-CA chip can be better controlled. 

• Generate some of the control signals for the processor data bus. 

1 .2.4 Cache Address Buffer 

The cache address buffer is required to distribute the cache address to all the 
data and tag cache RAMs. 

1.2.5 DECchip 21071-BA Features 

The DECchip 21071-BA chip provides a 32-bit data path from the Alpha 21064 
microprocessor to main memory and I/O. Depending on the selected width 
of the memory interface, two DECchip 21071-BA chips are required for a 
64-bit interface, and four DECchip 21071-BA chips are required for a 128-bit 
i nterface. 

The DECchip 21071-BA chip contains the cache and memory interface data 
path, which includes buffers for victim, noncacheable write, and DMA write 
operations. It also contains the I/O subsystem data path which provides 
buffering for DMA read and write data, and I/O read and write data. 

The DECchip 21071-BA chip interfaces with the cache and CPU using the CPU 
sysBus (pin bus). It interfaces with the 21071-DA through the 32-bit epIBus. 

The DECchip 21071-BA chip functions as the data path for the cache, memory, 
and I/O subsystem, and it contains the following data path functions: 

Error Correction/Detection Logic: The DECchip 21071-BA chip supports 
longword (32 bits) parity in 64-bit and 128-bit memory mode. ECC mode may 
be used with 128-bit wide memory by using some of the unused higher order 
CPU data bits as checl< bits. Error checking/generation is done only on DMA- 
initiated transactions; error checking/generation on CPU-initiated transactions 
is performed by the CPU. 

Memory VUk'ite Buffer: The memory write buffer has four entries; each entry 
is a cache line (32 bytes). This buffer is spread across the DECchip 21071-BA 
chips (two or four chips) in the system. Data stored in this buffer has been 
through all the cache coherency checks and is written to memory in the order 
it was received on the sysBus. 
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Memory Read Buffer: The memory read buffer is a one-cache-line (32 bytes), 
temporary holding buffer used to store data read from memory by the CPU or 
DMA requests. 

Merge and I/O Read Buffer: The merge and I/O read buffer is a one-cache- 
line (32 bytes), temporary holding buffer used to store data written by the CPU 
on memory writes or to store data read from the PCI bus on CPU reads from 
I/O space. 

I/O V\frite Buffer: The I/O write buffer has two entries - one entry acts as a 
write buffer for CPU I/O writes to the DECchip 21071-BA chip or PCI bus; the 
other acts as a holding buffer. 

DMA Read Buffer: The DMA read buffer stores data that is being read from 
the memory by a device on the PCI bus. This buffer is two cache lines deep 
and is spread across the DECchip 21071-BA chips in the system. 

DMA VUk'ite Buffer: The DMA write buffer stores four cache lines of PCI 
memory write data. Each entry is transferred to the memory write buffer after 
the necessary cache coherency checks have been performed. 

1.2.6 DECchip 21 071 -CA Features 

The DECchip 21071-CA chip provides the interface from the Alpha 21064 
microprocessor to cache and main memory and includes the cache and memory 
controller. The DECchip 21017-CA chip controls and moves data to and from 
banks of main memory. 

The DECchip 21071-CA responds to commands from the CPU and DECchip 
21071-DA chip and arbitrates between them. It also provides support for 
control of the Bcache RAMs during CPU cache miss and DMA transactions. 

The DECchip 21071-CA chip (cache/memory controller) can directly control up 
to 16 banks of DRAM memory. Each bank may be composed of either DRAM 
parts or SIMMs. 

Each DRAM may have IM, 4M, or 16M addressable locations (IM x 1, IM x 4, 
4M X 1, 4M X 4, and 16M x 1 DRAM sizes are supported). Each location 
consists of either a quadword or octaword of data, for 64-bit and 128-bit data 
width, respectively. Maximum DRAM memory is 4 GB and minimum DRAM 
memory is 8 MB. 

The DECchip 21071-CA chip provides support for a single video bankset of 
dual-port RAM (VRAM). This bankset can have 128K, 256K, or 512K locations. 
Each location consists of quadword data for a 64-bit interface, or octaword data 
for a 128-bit interface. VRAM capacity can vary from 1 MB to 8 MB. 
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The components of the cache and memory subsystem are distributed between 
the 21071-CA and 21071-BA. Together, the chips serve as an interface between 
the sysBus and memory subsystem (Figure 1-1). 

The CPU, 21071-DA, cache, and memory communicate with each other through 
the sysBus. The sysBus is essentially the processor pinbus with additional 
signals for DMA transaction control, arbitration, and cache control. The 
DECchip 21071-CA chip performs the Bcache and memory control functions. 

The following list summarizes the major features of the 21071-CA chip: 

• Provides control for filling the Bcache and extracting victims on CPU - 
initiated transactions. 

• Provides control for probing the Bcache on DMA transactions and 
invalidating the Bcache on DMA write hits. 

• Provides arbitration between the CPU and the DECchip 21071-DA chip for 
control of the sysBus. 

• Stores addresses for the four-cache-line memory write buffer. 

• Controls the loading of the I/O write buffer and the DMA read buffer. 

• Uses fast-page mode on the DRAMs to improve performance on DMA burst 
reads and memory writes. 

• Supports a frame buffer on the memory data bus. 

1.2.7 DECchip 21071-DA Features 

The DECchip 21071-DA chip functions as the bridge between the PCI and the 
CPU, its Bcache, and memory (Figure 1-1). The DECchip 21071-DA interface 
protocol is compliant with the PCI local bus. With the exception of a few 
pipeline registers and the parity tree, all the data path functions required to 
support the PCI reside in the DECchip 21071-BA chip. 

The DECchip 21071-DA chip provides all controls and interfaces to the PCI 
and sysBus and contains the following components and functions: 

sysBus interface state machine 

sysBus address decoder and translator 

epIBus arbitration and control 

PCI interface, state machines, and parity generation 

PCI address decoder and translator 
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The following list describes the major features of the DECchip 21071-DA chip: 

• Scatter/gather mapping from the 32-bit PCI address to the 34-bit physical 
address, with on-chip, 8-entry translation lookaside buffer (TLB) for fast 
address translations. To reduce cost, the scatter/gather tables are stored in 
memory and are automatically read by the DECchip 21071-DA chip (PCI 
bridge) when a translation misses in the TLB. 

• Supports a maximum PCI burst length of 16 longwords on PCI memory 
reads and writes. 

• Supports two types of addressing regions on CPU-initiated transactions to 
PCI space. 

- Sparse space for accesses with byte and word granularities, and a 
maximum burst length of 2. 

- Dense space for burst lengths from 1 to 8 longwords on writes and 
a burst length of 2 longwords on reads. This region can be used for 
memory-like structures such as frame buffers, which require high 
bandwidth accesses. 

• Stores address information for the DMA write buffer, and controls the 
loading of the DMA write buffer and I/O read buffer. 

• Stores address information for the I/O write buffer and controls the 
unloading of the I/O write buffer and DMA read buffer. 

Peripheral chips can be connected to the DECchip 21071-DA chip without any 
glue logic; however, logic that is external to the DECchip 21071-DA chip is 
required for interrupt arbitration, interrupt vector generation, DMA request 
generation, and interval timer implementation. 

Note 



The DECchip 21071-DA chip is not a PCI peripheral; it is a bridge 
between the PCI peripherals and the CPU/system memory. The 
DECchip 21071-DA chip implements the functions of a host bridge that 
are not sufficient to interface the DECchip 21071-DA chip as a PCI 
peripheral component. 
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1 .2.8 System Clock Generator 

Systems that use the DECchip 21071 or DECchip 21072 chipsets are targeted 
to run at system cycle times that range from 30 ns to 40 ns. The system clock 
generator must provide clklx2 and clk2ref to each chip in the chipset. 

Other system-specific clocks, for example, the PCI clock, must also be generated 
by the system clock generator. The system clock generator generates these 
clocks from the sysClkOutl_h and sysClkOut2_h signals, which are supplied 
by the Alpha 21064 microprocessor. 

1.2.9 Serial ROM 

The Alpha 21064 microprocessor provides an interface to a serial ROM, which 
can be used to initialize the instruction cache (I cache). The details for the 
implementation of this function can be found in the DECchip 21064 and 
DECchip 21064A Alpha AXP Microprocessors H ardware Reference M anual . 

1.2.10 Interrupt Control/CPU Configuration PAL 

The interrupt control/CPU configuration PAL provides system configuration 
information to the processor and six hardware interrupts. The PAL outputs 
connect to signals irq_h<5:0>from the Alpha 21064 microprocessor. 

When resetj is asserted, the PAL provides system clock configuration 
information and data bus width information to the processor on irq_h<5:0>. 

When resetj is deasserted, the PAL reflects the value of the system hardware 
interrupts to the processor on irq_h<5:0>. 

1.2.11 Memory SIMMs 

The DECchip 21071-CA chip (cach^memory controller) can directly control up 
to 16 banks of DRAM memory. Each bank may be composed of either DRAM 
parts or SIMMs. 

Each DRAM may have IM, 4M, or 16M addressable locations (IM x 1, IM x 4, 
4M X 1, 4M X 4, and 16M x 1 DRAM sizes are supported). Each location 
consists of either a quadword or octaword of data, for 64-bit and 128-bit data 
width, respectively. Maximum DRAM memory is 4 GB and minimum DRAM 
memory is 8 MB. 
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The DECchip 21071-CA chip provides support for a single video bankset of 
dual-port RAM (VRAM). This bankset can have 128K, 256K, or 512K locations. 
Each location consists of either a quadword or octaword of data, for 64-bit 
or 128-bit data width, respectively. Maximum VRAM memory is 8 MB and 
minimum VRAM memory is 1 MB. 

1.2.12 PCI Interrupt Controller 

An external interrupt controller is required to handle the interrupts posted by 
the PCI (and expansion bus) peripherals. 

1.2.13 PCI Peripherals 

The DECchip 21071 and DECchip 21072 chipsets, specifically the 21071-DA 
chip, can operate with any PCI -compliant, 32-bit peripheral. 

1.2.14 PCI Arbiter 

An external arbiter is required to determine ownership of the PCI bus during 
system operations. 

1.2.15 System ROM 

The system ROM contains all the console code and firmware that the system 
requires. The system ROM should be accessible to the DECchip 21071 and 
DECchip 21072 chipsets through the PCI bus. 
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Parti 



Part I contains six chapters that provide information about the DECchip 
21071-CA chip. The following table provides a brief description of each 
chapter: 



Chapter 



Description 



2 
3 
4 
5 

6 

7 



Describes the DECchip 21071-CA pin signals. 

Describes the DECchip 21071-CA architecture. 

Describes the DECchip 21071-CA control and status registers. 

Describes the transactions supported by the DECchip 21071-CA chip on 
the sysBus and memory interface. 

Describes the DECchip 21071-CA electrical requirements. 

Describes the behavior of the DECchip 21071-CA chip during power-up. 



2.1 



2 

DECchip 21071-CA Pin Descriptions 

This chapter provides a description of the DECchip 21071-CA pin signals. 

DECchip 21071-CA Pin List 

Table 2-1 lists the pin signals grouped by function. The information in the 
Type column identifies a signal as input (I), output (O), or bidirectional (B). 
The Buffer Strength column indicates the buffer drive strength. 

All output and bidirectional pins, except pTestout, can betristated. 



Table 2-1 DECchip 


21071-CA Pin List 












Buffer 




Signal Name 


Quantity 


Type 


Strength 


Function 


CPU/Bcache Signals 










(85 Total) 










sysData<15:0> 


16 


B 


4 ma 


Data pins for CSR data 


sysAdr<33:5> 


29 


1 


- 


Address bus 


tagAdr<31:17> 


15 


B 


4 ma 


Bcache tag 


tagAcirP 


1 


B 


4 ma 


Bcache tag parity 


tagCtIV 


1 


B 


4 ma 


Bcache valid bit 


tagCtID 


1 


B 


4 ma 


Bcache dirty bit 


tagCtIP 


1 


B 


4 ma 


Bcache control parity bit 


cpuCWMasl«7:0> 


8 


1 


- 


Cycle write mask 


cpuCReq<2:0> 


3 


1 


- 


Cycle request 


cpuCAck<2:0> 


3 





4 ma 


Command acknowledge 


cpuDRAck<2:0> 


3 





4 ma 


Data read acknowledge 


cpuDWSel<l> 


1 





4 ma 


Data word select 


cpuDlnvReq 


1 





4 ma 


Dcache invalidate 
request 


cpuHolcJReq 


1 





4 ma 


Hold request 



(continued on next page) 
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Table 2-1 (Cont.) DECchip 21071-CA Pin List 



Signal Name 


Quantity 


Type 


Buffer 
Strength 


Function 


CPU/Bcache Signals 
(85 Total) 


cpuHoldAck 


1 


1 


- 


Hold acknowledge 


Bcache/PAL Control Signals 
(9 Total) 


sysEarlyOEEn 

sysTagOEEn 

sysDataOEEn 









8 ma 
8 ma 
8 ma 


Early output enable 
Bcachetag output enable 
Bcache data output 
enable 


sysDataALEn 







8 ma 


Bcache address bit 
enable low phase 


sysDataAHEn 

sysTagWE 
sysDataWEEn 










n3 03 03 

E E £ 

00 00 00 


Bcache address bit 
enable high phase 
Bcache tag write enable 
Bcache data short-write 
WE enable 


sysDataLongWE 







8 ma 


Bcache data long-write 
write enable 


sysDOE 







8 ma 


PAL CPU data output 
enable 


PCI Bridge Interface Signals 
(9 Total) 



ioRequest<l:0> 


2 


1 


- 


21071-DA sysBus cycle 
request 


ioGrant 


1 





8 ma 


21071-DA sysBus cycle 
grant 


ioCmd<2:0> 


3 


1 


- 


21071-DA command 
request 


ioCAck<l:0> 


2 





8 ma 


21071-DA command 
acknowledge 


ioDataRdy 


1 





8 ma 


21071-DA DMA read 
data ready 



(continued on next page) 
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Table 2-1 (Cont.) DECchip 21071-CA Pin List 



Signal Name 



Quantity Type 



Buffer 
Strength 



Function 



Data Path Control Signals 
(16 Total) 



drvSysData 







8 ma 


drvSysCSR 







4 ma 


drvM emData 







8 ma 


syslORead 







8 ma 


sysReadOW 







8 ma 


subCmdA<l:0> 


2 





4 ma 


subCmdB<l:0> 


2 





4 ma 


subCmdCommon 


1 





8 ma 


sysCmd<2:0> 


3 





8 ma 


manCmd<3:l> 


3 





8 ma 



Turns on the 21071-BA 
sysData<127:16> drivers 
Turns off tlie 21071-BA 
sysData<15:0> drivers 
Turns on tlie 21071-BA 
memData drivers 
Selects I/O read buffer to 
sysBus 

Selects octaword to be 
returned on sysBus 
Sub-commands for 
sysBus 

Sub-commands for 
sysBus 

Sub-command for sysBus 
Commands for sysBus 
side of 21071-BA chip 
Commands for memory 
side of the 21071-BA 
chip 

(continued on next page) 
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Table 2-1 (Cont.) DECchip 21071-CA Pin List 



Signal Name 


Quantity 


Type 


Buffer 
Strength 


Function 


Memory Signals 
(39 Total) 


memAdr<ll:0> 


12 





8 ma 


M emory address 


memRAS J <8:0> 


9 





8 ma 


M emory row address 
strobe 


memRASBJ<8:0> 


9 





8 ma 


Memory second subset 
RAS 


memCAS J <3:0> 


4 





8 ma 


Memory column address 
strobe 


memWE I<1:0> 


2 





8 ma 


M emory write enable 


memPDCIk 


1 





4 ma 


M emory presence detect 
clock 


memPDLoadJ 


1 





4 ma 


M emory presence detect 
load enable 


memPDDIn 


1 


1 


4 ma 


M emory presence detect 
data in 


Video Support Signals 
(4 Total) 



vFrameJ 


1 


1 


vRefreshJ 


1 


1 


memDTOE_l 


1 





memos F 


1 






8 ma 



8 ma 



Video request for full 
serial register load 
Video request for split 
serial register load 
Dual function data and 
output enable for VRAM 
bank 

Special function output 
for VRAM bank 

(continued on next page) 



2-4 DECchip 21071-CA Pin Descriptions 



Table 2-1 (Cont.) DECchip 21071-CA Pin List 









Buffer 




Signal Name 


Quantity 


Type 


Strength 


Function 


Miscellaneous/Clock Signals 








(8 Total) 










wideMem 


1 


1 


- 


If true, indicates 128-bit 
wide memory 


clklx2 






- 


Clocl< input 


clk2ref 






- 


Piiase reference for 
cll<lx2 


reset 1 






- 


Reset 


testM ode 






- 


Test mode select 


scanEnable 






- 


Scan enables 


tristatej 






- 


Tri states all output and 
bidirectional pins 


pTestout 


1 





4 ma 


Parametric NAND tree 
output 



Pin Totals 

Total input pins: 

Total output pins: 

Total signal pins: 

Total power and ground pins: 

Total pins: 



56 
114 
170 

36 

206 



2.2 DECchip 21071-CA Signal Descriptions 

This section provides pin signal information, including a description of the 
signal, the clock edge on which the signal changes, and rules about signal 
usage during various sysBus transactions. 

For simplicity, the signal sysclkOutl_h will be treated as clklR. See 
Section 6.2.1 for more information about the clocks on the 21071-CA chip. 

Signal descriptions are grouped by function and correspond to the pin list 
(Table 2-1). 

Note 



The Alpha 21064 microprocessor does not useclklR; rather, it uses 
sysClkOut_h to generate and sample signals. 
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2.2.1 CPU/Bcache Signals 

This section describes the CPU/Bcache signals. 

2.2.1.1 sysData<15:0> 

Signal Type: Bidirectional - (21071-BA, CPU, Bcache, 21071-CA) 
Input Sampling Clock Edge: clk2F 
Output Clock Edge: clklR 

sysData<15:0> is a bidirectional bus that provides data to and from the 
DECchip 21071-CA chip and the CPU. The default driver of sysData<15:0> 
is the CPU. 

sysData<15:0> is used to read and write the CSR data for the 21071-CA chip. 
The 21071-CA chip does not support error checking on its CSR transactions, 
so corresponding sysCheck signals do not go to the 21071-CA. On a CSR read 
transaction, the 21071-CA chip drives sysData<15:0>. The rest of the bits are 
driven by the 21071-BA data chips. 

2.2.1.2 sysAdr<33:5> 

Signal Type: 21071-CA Input, CPU output, 21071-DA bidirectional 
Input Sampling Clock Edge: Latch closes on clklR or clklF 
Output Clock Edge: clklR 

sysAdr<33:5> contains the cache line address of sysBus transactions. 
sysAdr<33:32> indicates the address quadrant. 

sysAdr<33:5> is driven by the CPU on CPU-initiated transactions and by the 
21071-DA chip on DMA transactions. 

• On CPU-initiated transactions, the 21071-CA chip latch opens when 
cpuCReq<2:0> becomes non-idle and closes on the next clklR. 

• On DMA transactions, the 21071-CA chip latch opens when DMA owns the 
sysBus and closes on the clklF which is 1.5 cycles after the 21071-DA chip 
has driven the address. 

2.2.1.3 tagAdr<31:17> 

Signal Type: Bidirectional (21071-CA, Bcache), CPU input 
Input Sampling Clock Edge: clklF 
Output Clock Edge: clklF 

tagAdr<31:17> carries Bcache tag information. The only addresses that 
are cached are those with sysAdr<33:32> = 00. Bits <33:32> of the tag are 
assumed to be 00. 



2-6 DECchip 21071-CA Pin Descriptions 



The tagAdr<33:32> pins of the DECchip 21064 microprocessor should be tied 
to 00, and only bits <31:17>are variable. The number of significant bits of 
the tag depends on the depth of the Bcache RAMs and the maximum memory 
capacity of the system. 

On a Bcache miss transaction, the tag address is driven ontotagAdr<31:17>by 
the 21071-CA chip and written into the tag data store. 

tagAdr<31:17> is read by the processor during a cache probe. The processor 
does not drive these signals at any time. 

The Bcache tag store drives tagAdr<31:17> with the assertion of sysE arlyOE E n, 
supplied by the 21071-CA chip on CPU read block, CPU write block, CPU, 
LDx_L, and CPU STx_C transactions. On DMA transactions, the Bcache tag 
store drives tagAdr<31:17> when the 21071-CA chip asserts sysTagOEEn. 

Unused tagAdr bits should be pulled down on the module. 

2.2.1.4 tagAdrP 

Signal Type: Bidirectional (21071-CA, CPU, Bcache) 
Input Sampling Clock Edge: clklF 
Output Clock Edge: clklF 

tagAdrP is an even parity bit over the significant bits of tagAdr<33:17>. The 
number of bits that participate in the parity computation depend on the size of 
the Bcache. 

2.2.1.5 tagCtIV 

Signal Type: Bidirectional (21071-CA, CPU, Bcache) 
Input Sampling Clock Edge: clklF 
Output Clock Edge: clklF 

tagCtIV indicates that the cache entry is valid. The 21071-CA chip sets this bit 
during cache fills and clears this bit during DMA writes that hit in the cache. 

2.2.1.6 tagCtID 

Signal Type: Bidirectional (21071-CA, CPU, Bcache) 
Input Sampling Clock Edge: clklF 
Output Clock Edge: clklF 

tagCtID indicates that the cache entry is dirty. The 21071-CA chip sets this bit 
during write allocate cache fills. The processor sets this bit during CPU writes 
that hit in the Bcache. 
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2.2.1.7 tagCtIP 

Signal Type: Bidirectional (21071-CA, CPU, Bcache) 
Input Sampling Clock Edge: clklF 
Output Clock Edge: clklF 

tagCtIP is an even parity bit over tagCtIV and tagCtlD. 

2.2.1.8 cpuCWMask<7:0> 

Signal Type: 21071-CA Input 

Signal Source: CPU 

Input Sampling Clock Edge: clklR and clklF 

cpuCWMask<7:0> is used on CPU-initiated read block and write block 
transactions. These signals carry different information on these transactions. 

• On CPU write block and 5Tx_C transactions, these signals carry the 
longword mask for the whole cache line. An asserted cpuCWMask signal 
indicates that the corresponding longword from the cache line is valid and 
should be written. 

Any combination of mask bits is allowed on cpuCWMask<7:0> during 
a CPU write block transaction. CPU STx_C transactions can only have 
combinations that correspond to a single quadword or longword. 

• On CPU read block and LDx_L transactions, the cpuCWMask<7:0> signals 
carry additional information about the read transaction. cpuCWMask<l:0> 
carries address bits <4:3>, which indicate the address of the actual 
quadword that missed. 

This information can be used to implement quadword granularity to I/O 
space, as well as to provide wrapping in memory space. cpuCWMask<2> 
indicates the type of read reference. cpuCWMask<2>is true if the miss is 
a Dstream reference, and it is false if the miss is an I stream reference. 
cpuCWMask<6> is ignored, but it contains longword or quadword 
information on LDxL transactions in the Alpha 21064 microprocessor. 

2.2.1.9 cpuCReq<2:0> 

Signal Type: 21071-CA Input 

Signal Source: CPU 

Input Sampling Clock Edge: clklF 
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Whenever the processor wants to initiate an external transaction, it puts a 
transaction type code onto cpuCReq<2:0>. Table 2-2 lists the encodings for the 
different transaction types. 

Table 2-2 CPU-Initiated Transaction Encodings 



cpuCReq<2:0> 


Transaction 


000 


Idle 


001 


Barrier 


010 


Fetch 


on 


FetchM 


100 


Read block 


101 


Write block 


no 


LDx L 


111 


STx C 



The transaction types are held on cpuCReq<2:0> until the end of the 
transaction; therefore, there is no need to latch these signals. 

Transactions on cpuCReq<2:0>are ignored by the 21071-CA and 21071-DA 
chips when the bus is granted to the 21071-DA chip for DMA transactions. 
cpuCReq<2:0>are ignored from the cycle that cpuHoldReq was asserted by the 
21071-CA through the cycle after cpuHoldAck is deasserted at the end of the 
DMA transaction. 



2.2.1.10 cpuCAck<2:0> 



Signal Type: 21071-CA Output 
Signal Destination: CPU 
Output Clock Edge: clklR 

The 21071-CA chip provides transaction acknowledge information to the 
CPU on cpuCAck<2:0>. The 21071-CA chip is the only driver of these 
signals. On CPU-initiated transactions addressed to the 21071-DA or the 
PCI, the 21071-CA chip receives transaction acknowledge information from the 
21071-DA chip on ioCmd<2:0>and forwards it to the CPU on cpuCAck<2:0>in 
the following cycle. 

Table 2-3 lists the encodings for cpuCAck<2:0>. 
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Table 2-3 cpuCAck Encodings 



cpuCAck<2:0> 



Acknowledge 



Description 



000 
001 
010 

on 

100 



Idle 




Hard 1 


zrror 


Soft_E 


rror 


STx C 


Fail 


OK 





Transaction failed in a catastropliic manner. 
A failure occurred in tlie transaction, but was 
corrected. (Not used.) 
CPU STx_C transaction failed. 
Transaction completed successfully. 



2.2.1.11 cpuDRAck<2:0> 

Signal Type: 21071-CA Output 
Signal Destination: CPU 
Output Clock Edge: clklR 

The 21071-CA chip indicates to the CPU that valid read data is on the sysBus, 
indicates whether the data should be cached and indicates whether ECC 
checking and correction or parity checking should be performed. Table 2-4 lists 
the encodings of cpuDRAck<2:0>. 

The 21071-CA chip is the only driver of these signals. On CPU-initiated 
transactions addressed to the 21071-DA chip or the PCI , the 21071-CA chip 
receives transaction acknowledge information from the 21071-DA chip on 
ioCmd<2:0> and forwards it to the CPU on cpuDRAck<2:0> in the following 
cycle. 

Table 2-4 cpuDRAck Encodings 



cpuDRAck<2:0> Acknowledge 



Description 



2.2.1.12 



000 


idli 


e 


100 


ok 


NCache NChk 


101 


ok. 


_N Cache 


no 


ok 


NChk 


111 


ok 




cpuDWSel<1> 







Data valid, don't cache, don't check. 

Data valid, don't cache, check ECC or parity. 

(Not used.) 

Data valid, cache, don't check. (Not used.) 

Data valid, cache, check ECC or parity. 



Signal Type: 21071-CA Output 
Signal Destination: CPU 
Output Clock Edge: clklR 

During a CPU write, the 21071-CA chip uses cpuDWSel<l> to indicate to the 
processor which data word should be driven on the sysBus. 
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When the CPU owns the sysBus, cpuDWSel<l> is asserted to the CPU as 
soon as the 21071-CA chip has decoded a write blocl< or STx_C command on 
cpuCReq<2:0>. Once the high octaword of CPU data has been loaded into the 
21071-BA chips, cpuDWSel<l>is deasserted. 

Note 



The 21071-CA chip controls the rate at which CPU write data is 
available on the sysBus with cpuDWSel<l>. The 21071-DA chip (I/O 
bridge) is always capable of accepting all the data on a CPU-initiated 
I/O write transaction on the sysBus. The I/O write can be stalled on the 
sysBus by delaying cpuCAcl«2:0> after all the data has been latched. 



2.2.1.13 cpuDlnvReq 



Signal Type: 21071-CA Output 
Signal Destination: CPU 
Output Clock Edge: cll<lR 

The 21071-CA chip asserts cpuDlnvReq when it needs to invalidate an entry 
in the CPU internal Dcache. The signal is asserted while the index to the 
Dcache is stable on the I Adr<12:5> pins (a buffered or unbuffered version of 
sysAdr<12:5>) of the CPU. This signal should be tied to the dInvReq pins of 
theCPU. 



2.2.1.14 cpuHoldReq 



Signal Type: 21071-CA Output 
Signal Destination: CPU 
Output Clock Edge: clklR 

The 21071-CA chip asserts cpuHoldReq to get ownership of the Bcache 
when the 21071-DA chip has won arbitration for the sysBus. If an external 
transaction is present on the sysBus, cpuHoldReq is asserted at the end of that 
transaction. If the bus is idle or if the 21071-DA chip is requesting preemption, 
cpuHoldReq is asserted right away. 
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2.2.1.15 cpuHoldAck 

Signal Type: 21071-CA Input 

Signal Source: CPU 

Input Sampling Clock Edge: clklF 

The processor asserts cpuHoldAck to indicate that it has given up control of 
the cache to the 21071-CA chip. The minimum delay from the assertion of 
cpuHoldReq to the assertion of cpuHoldAck is two sysBus cycles. 

The deassertion of cpuHoldReq causes cpuHoldAck to deassert in one sysBus 
cycle. When the processor asserts cpuHoldAck, it will have turned off its 
external drivers on or before clklR. When the processor deasserts cpuHoldAck, 
it does not turn on its drivers for two CPU cycles after clklR. 

2.2.2 Bcache/PAL Control Signals 

This section describes the Bcache/PAL control signals. 

2.2.2.1 sysEarlyOEEn 

Signal Type: 21071-CA Output 
Signal Destination: Bcache PAL 
Output Clock Edge: clklR 

sysEarlyOEEn is asserted during sysBus idle cycles to allow CPU data bus 
drivers, as well as data and tag RAM output enables to be asserted from the 
PALs as quickly as possible when the CPU asserts cpuCReq<2:0>. 

sysEarlyOEEn is asserted on clklR in the idle cycle where cpuCReq<2:0> may 
be asserted. When sysEarlyOEEn is asserted, cpuCReq<2:0>will cause various 
outputs to assert, as shown in Table 2-5. 

Table 2-5 sysEarlyOEEn Effect on bcTagOEJ and bcDataOEJ 



cpuCReq<2:0> 


Command 


bcTagOEJ 


bcDataOEJ 


cpuDOEJ 


000 


Idle 


F 


F 




001 


Barrier 


T 


T 




010 


Fetch 


T 


T 




on 


FetchM 


T 


T 




100 


Read block 


T 


T 




101 


Write block 


T 


F 




no 


LDx L 


T 


T 




111 


STx C 


T 


F 
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2.2.2.2 sysTagOEEn 



Signal Type: 21071-CA Output 
Signal Destination: Bcache PAL 
Output Clock Edge: clklF or clklR 

sysTagOEEn is asserted by the 21071-CA chip during DIMA transactions after 
the processor has given ownership of the cache by asserting cpuHoldAck. 
sysTagOEEn is also asserted during CPU-initiated, non-cacheable transactions 
to avoid long tristate times on tagAdr<31:17>and sysTagCtl. 

sysTagOEEn is asserted on clklR in the first cycle of a DMA transaction (the 
cycle when ioCmd<2:0>is driven). During all other cycles it is asserted and 
deasserted on clklF. 

2.2.2.3 sysDataOEEn 

Signal Type: 21071-CA Output 
Signal Destination: Bcache PAL 
Output Clock Edge: clklF, clk2F or clklR 

sysDataOEEn is asserted by the 21071-CA chip whenever it needs to read data 
from the Bcache. This occurs during a victim read, during an LDx_L or STx_C 
transaction that hits in the cache, and during all DMA transactions, because 
the data cache is never written during DMA. 

sysDataOEEn is asserted on clklR in the first cycle of a DMA transaction (the 
cycle when ioCmd<2:0> is driven). 

sysDataOEEn is asserted on clklF in the cycle before CPU Write Allocate 
Victim data is read from the cache. (This makes the access time path from the 
SRAM output enable 174 cycles.) 

In all other cases sysDataOEEn is asserted and deasserted on clk2F. 

2.2.2.4 sysDataALEn 

Signal Type: 21071-CA Output 
Signal Destination: Bcache PAL 
Output Clock Edge: clk2R 
Input Sampling Clock Edge: clk2F 

sysDataALEn and sysDataAHEn are sent to the PAL to generate the lower 
address bit for the Bcache data RAMs. The lower address bit must be toggled 
to the Bcache during cache fills, victim reads, reads that hit the cache, and 
during LDx_L and STx_C hits. 

The PAL receives the sysDataALEn signal to enable bcDataA<4> for the period 
when clk2 is low. 
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2.2.2.5 sysDataAHEn 



Signal Type: 21071-CA Output 
Signal Destination: Bcache PAL 
Output Clock Edge: clk2F 
Input Sampling Clock Edge: clk2R 

sysDataALEn and sysDataAHEn are sent to the PAL to generate the lower 
address bit for the Bcache data RAMs. The lower address bit must be toggled 
to the Bcache during cache fills, victim reads, reads that hit the cache, and 
during LDx_L and STx_C hits. 

The PAL receives the sysDataAHEn signal to enable bcDataA<4> for the period 
when clk2 is high. 



2.2.2.6 sysTagWE 



Signal Type: 21071-CA Output 
Signal Destination: Bcache PAL 
Output Clock Edge: clklR 
Input Sampling Clock Edge: RAM WE 

This signal is asserted when a write to the tag address and control cache RAMs 
is needed. sysTagWE is NORed with the CPU write enable pulse to generate 
the tag control write enable. sysTagWE is inverted to generate the tag address 
write enable. 



2.2.2.7 sysDataWEEn 



Signal Type: 21071-CA Output 
Signal Destination: Bcache PAL 
Output Clock Edge: clklR 
Input Sampling Clock Edge: clklF 

This signal is asserted to the PALs when a write to the data cache RAMs is 
needed. sysDataWEEn is used if the system is performing short writes. The 
actual write enable pulse is generated by the PAL by ANDing sysDataWEEn 
with an inverted clkl signal. It is then NORed with the CPU write enable 
signal to generate the data RAM write enable. 
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2.2.2.8 sysDataLongWE 

Signal Type: 21071-CA Output 
Signal Destination: Bcache PAL 
Output Clock Edge: clklF 
Input Sampling Clock Edge: RAM WE 

This signal is asserted to the PALs when a write to the data cache RAIM s is 
needed. sysDataLongWE is used if the system is doing long writes. The write 
enable pulse is NORed with the CPU write enable pulse to generate the data 
RAM write enable. 

2.2.2.9 sysDOE 

Signal Type: 21071-CA Output 

Signal Destination: PAL 

Output Clock Edge: clklR 

Input Sampling Clock Edge: Flow through 

sysDOE enables the processor data output enable during CPU external write 
cycles. sysDOE flows through the PAL and causes cpuDOE_l to assert. 

2.2.3 PCI Bridge Interface Signals 

This section describes the PCI bridge interface signals. 

2.2.3.1 ioRequest<1 :0> 

Signal Type: 21071-CA Input 
Signal Source: 21071-DA 
Input Sampling Clock Edge: clklF 
Output Clock Edge: clklR 

The 21071-DA chip asserts ioRequest<l:0>to request ownership of the sysAdr 
lines to perform a DMA transaction. ioRequest<l:0> is acknowledged using 
ioGrant. 

A request may be asserted for three cycles before the bus is actually required, 
because three cycles are required to acquire ownership of the Bcache from the 
CPU. When a DMA transaction is started, ioRequest<l:0> should be returned 
to idle in the same cycle as ioCmd<2:0> if no further DMA transactions are 
required. Table 2-6 lists the encodings for ioRequest<l:0>. 
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Table 2-6 ioRequest<1:0> Encodings 



ioRequest<1 :0> 


Function 


00 


Idle 


01 


DMA preempt request 


10 


DMA request 


11 


DMA atomic request 



When the 21071-DA chip uses the DIMA request encoding, the bus arbiter 
determines who will get the bus based on which node currently has the bus 
and programmed priority. 

The 21071-DA chip uses the DMA atomic request encoding when it needs 
to do multiple DMA transactions on the sysBus without the intervention of 
transactions from the CPU. For the first transaction, the 21071-DA chip uses 
the DMA request encoding. After that request has been granted, the 21071-DA 
chip changes ioRequest<l:0>to the DMA atomic request encoding. 

Assertion of a DMA preempt request should be done only during memory 
barriers or to avoid deadlocks when the CPU owns the sysBus and is 
addressing the 21071-DA chip address space. A preempt request forces the 
21071-DA chip to win arbitration and causes the 21071-CA chip to assert 
cpuHoldReq in the middle of the CPU transaction. The 21071-DA chip can 
keep DMA preempt request asserted for consecutive DMA transactions. 
For example, when a CPU request needs to be preempted by a DMA write 
transaction to flush the DMA write buffer, the 21071-DA chip should keep a 
DMA preempt request asserted through the entire flush of the buffer until all 
DMA write transactions have been completed. 

2.2.3.2 ioGrant 

Signal Type: 21071-CA Output 
Signal Destination: 21071-DA 
Output Clock Edge: clklR 
Input Sampling Clock Edge: clklF 

The 21071-CA chip indicates to the 21071-DA chip that it has won ownership of 
the sysBus by asserting ioGrant in response to ioRequest<l:0>. On assertion of 
ioGrant, the 21071-DA chip must not begin any new CPU transactions. When 
ioGrant and cpuHoldAck are both asserted, the 21071-DA chip may begin a 
new DMA transaction. If the 21071-DA chip samples ioGrant as deasserted 
in any cycle, its sysAdr drivers must be tristated on the next clklR. The 
21071-DA chip uses the ioGrant in combination with cpuHoldAck to determine 
if cpuCReq<2:0> should be ignored. 

2-16 DECchip 21071-CA Pin Descriptions 



2.2.3.3 ioCmd<2:0> 



Signal Type: 21071-CA Input 
Signal Source: 21071-DA 
Input Sampling Clock Edge: clklF 
Output Clock Edge: clklR 

The 21071-DA chip asserts ioCmd<2:0>to request an action by the 21071-CA 
chip. When the 21071-DA chip has the sysBus, ioCmd<2:0> is used to request 
a bus transaction. 

When the CPU has the bus, ioCmd<2:0> is used to request assertion of the 
cpuCAck<2:0> and cpuDRAck<2:0> signals. 

Note 



There is no encoding for cpuDRAck<2:0>ok_NChk. The 21071-DA chip 
never returns cacheable, non-checkable read data. 



A cpuCAck<2:0>or cpuDRAck<2:0> request must not be sent during DMA, one 
cycle after the 21071-CA chip sends ioGrant, or one cycle after the 21071-DA 
chip requests a preempt. Table 2-7 lists the encodings for ioCmd<2:0>. 



Table 2-7 ioCmd<2:0> Encodings 



ioCmd<2:0> 


CPU Owns sysBus 


21071-DA Owns sysBus 


000 


Idle 


Idle 


001 


CIrLock 


Flush 


010 


cpuDRAck ok N Cache NChk 


Write 


on 


cpuDRAck ok N Cache 


Write masked 


100 


cpuCAck ok 


Read 


101 


cpuCAck Hard Error 


Read burst 


110 


cpuCAck Soft_Error 


Read wrapped 


111 


cpuCAck STxC_Fail 


Read burst wrapped 
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2.2.3.4 ioCAck<1:0> 



Signal Type: 21071-CA Output 
Signal Destination: 21071-DA 
Input Sampling Clock Edge: clklF 
Output Clock Edge: clklR 

The 21071-CA chip asserts ioCAck<l:0>to acknowledge a DMA transaction. 
ioCAck<l:0> indicates that the DIMA transaction has been completed. If any 
error occurs during the transaction, an error response is sent. Table 2-8 lists 
the encodings for ioCAck<l:0>. 

Table 2-8 ioCAck<1:0> Encodings 
ioCAck<1 :0> Function 



00 


Idle 


01 


Reserved/unused 


10 


DMA cycle acknowledge 


11 


DMA cycle error 



2.2.3.5 ioDataRdy 



Signal Type: 21071-CA Output 
Signal Destination: 21071-DA 
Input Sampling Clock Edge: clklF 
Output Clock Edge: clklR 

During any DMA read, IoDataRdy is asserted when read data is ready on 
the sysBus. IoDataRdy is used by the 21071-DA chip to get an early start 
on getting read data from the DMA read buffer without having to wait for 
ioCAck<l:0>. When the 21071-DA chip receives IoDataRdy, data will be 
available on epiData<31:0>in the next cycle. 

Note 



The number of ioDataRdy assertions may not correspond to the number 
of octawords loaded into the DMA read buffer. The 21071-DA chip 
must ignore IoDataRdy if a DMA read is not in progress. 



When the 21071-DA chip receives ioCAck<l:0>, the entire cache block is 
available in the DMA read buffer. The data may be read out on epiData<31:0> 
two cycles after acknowledge of ioCAck<l:0> is received. (See Figure 16-1.) 
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2.2.4 Data Path Control Signals 

This section describes the data path control signals. 

2.2.4.1 drvSysData 

Signal Type: 21071-CA Output 
Signal Destination: 21071-BA 

Output Clock Edge: clk2R assertion, clk2F deassertion 

Input Sampling Clock Edge: clklR assertion, clklF deassertion. 

drvSysData is asserted by the 21071-CA chip to indicate that the 21071-BA 
chip should drive sysData and sysCheck on the next clklR. When deasserted, 
drvSysData indicates to the 21071-BA chip that it should tristate the sysBus 
on the next clklF. 

2.2.4.2 drvSysCSR 

Signal Type: 21071-CA Output 
Signal Destination: 21071-BA 
Output Clock Edge: clk2R 
Input Sampling Clock Edge: clklR 

drvSysCSR is asserted by the 21071-CA chip to indicate that the 21071-CA 
chip is driving sysData<15:0> on the next clklR, and that the lower order 
21071-BA chip should not drive these lines. 

The drvSysCSR signal is normally deasserted, except during CSR reads. When 
drvSysData is asserted and drvSysCSR is not asserted, the 21071-BA chips 
will drive all sysData<127:0> lines. 

On a CSR read to the 21071-CA chip, both drvSysData and drvSysCSR are 
asserted. This will result in the 21071-BA chips driving sysData<127:16>and 
the 21071-CA chip driving sysData<15:0>. 

2.2.4.3 drvMemData 

Signal Type: 21071-CA Output 

Signal Destination: 21071-BA 

Input Sampling Clock Edge: Flow through 

Output Clock Edge: memClkR 

drvMemData is asserted by the 21071-CA chip to indicate that the 21071-BA 
chips should drive memData on the next memClkR. 
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2.2.4.4 syslORead 



Signal Type: 21071-CA Output 
Signal Destination: 21071-BA 
Output Clock Edge: clklR 
Input Sampling Clock Edge: clk2F 

syslORead is asserted by the 21071-CA chip and drvSysData to indicate that 
the contents of the I/O read buffer should be driven onto the sysBus. 

2.2.4.5 sysReadOW 

Signal Type: 21071-CA Output 
Signal Destination: 21071-BA 
Input Sampling Clock Edge: clk2F 
Output Clock Edge: clklR 

sysReadOW is asserted by the 21071-CA chip to indicate to the 21071-BA chips 
that the upper octaword of data should be taken from the memory read, merge, 
and I/O read buffers. 

2.2.4.6 subCmdA<1 :0>, subCmdB<1 :0>, subCmdCommon 

Signal Type: 21071-DA Output 
Signal Destination: 21071-BA 
Output Clock Edge: clklR 
Input Sampling Clock Edge: clk2F 

The subCmd signals are asserted to further qualify the sysCmd<2:0> signals 
(Table 2-10). Table 2-9 describes how to connect the various subCmd pins 
from the 21071-CA chip to the 21071-BA chips. 



Table 2-9 SubCmd Connections 



21071-BA Pin, 64-Bit IVIemory 
21071-CA Pin DECcliip 21071 Configuration 



21071-BA Pin, 128-Bit IVIemory 
DECcliip 21072 Configuration 



subCmdA<0> 
subCmdA<l> 
subCmdB<0> 
subCmdB<l> 
subCmdCommon 



21071-BA subCmd<0> 
21071-BA subCmd<l> 
21071-BA 1 subCmd<0> 
21071-BA 1 subCmd<l> 
Not applicable 



21071-BA subCmd<0> 
21071-BA 2 subCmd<0> 
21071-BA 1 subCmd<0> 
21071-BA 3 subCmd<0> 
21071-BA 0-3 subCmd<l> 
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2.2.4.7 sysCmd<2:0> 



Signal Type: 21071-CA Output 
Signal Destination: 21071-BA 
Output Clock Edge: clklR 
Input Sampling Clock Edge: clk2F 

The sysCmd<2:0> signals, in combination with the subCmcl<l:0> signals 
indicate to the 21071-BA chip the action to tal<e on the sysData bus. I n 
general, they echo the actions tal<ing place on the sysBus during the previous 
cycle. The bits are decoded into various actions based on the information in the 
following table. 



Table 2-10 sysCmd<2:0> and subCmd<1 :0> Encodings 



sysCmd subCmd Mnemonic 



Function 



000 



000 



001 



010 



on 



ox 



IX 



XX 



XX 



XX 



RESET 



NOP 



LOAD 



RDDMAS 
WRIO 



RDDMAM 



The merge bits in the merge buffer are cleared. 
All sysBus counters are reset. The data in the 
pad latches is held (to save power). 

The data in the pad latches is held in the 
latches, and new data will not be clocked into 
them. Used during reads or to hold the first 
transfer of write data when the write buffer is 
full. 

No write action is performed. Sent when 
waiting for write data to be ready. Data from 
the sysData bus is loaded into the pad flops. 

Data in the sysData pad latches is loaded into 
the DMA read buffer, which also serves as the 
I/O write buffer. A counter is incremented so 
that the next RDDMAS will load data into the 
next sub-cache line of the buffer. 

Data in the memory read buffer is loaded into 
the DMA read buffer. A counter is incremented 
so that the next RDDMAM will load data into 
the next sub-cache line of the buffer. 



(continued on next page) 
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Table 2-10 (Cont.) sysCmd<2:0> and subCmd<1 :0> Encodings 



sysCmd subCmd Mnemonic 



Function 



100 00 MERGEOO Nothing is loaded into the merge buffer. A 

counter is incremented so that the next 
MERGEnn will load data into the next sub- 
cache line of the buffer. 

During STx_C transactions that hit in the 
cache, each sub-cache line of the merge buffer 
is loaded twice: once with the CPU write 
data using MERGE (that is, MERGEOl) and 
once with the cache data using MERGE with 
inverted enables, called an overlay (that is, 
OVLYIO). 

100 01 MERGEOl Same as MERGEOO, except I ongword in 

the sysData pad latches is loaded into the 
read/merge buffer, and the merge bit that 
corresponds to longword is set. 

100 10 MERGEIO Same as MERGEOO, except longword in 

the sysData pad latches is loaded into the 
read/merge buffer, and the merge bit that 
corresponds to longword 1 is set. 

100 11 MERGE 11 Same as MERGEOO, except longwords and 

1 in the sysData pad latches are loaded into 
the read/merge buffer, and the merge bits that 
correspond to longwords and 1 are set. 



101 00 



101 01 



101 10 



WR5YS0 



WRSYSl 



WRSYS2 



Data in the sysData pad latches is loaded into 
the memory write buffer, which represents 
cache line 0. A counter is incremented so that 
the next WRSYSO will load data into the next 
sub-cache line of cache line 0. 

Data in the sysData pad latches is loaded into 
the memory write buffer, which represents 
cache line 1. A counter is incremented so that 
the next WRSYSl will load data into the next 
sub-cache line of cache line 1. 

Data in the sysData pad latches is loaded into 
the memory write buffer, which represents 
cache line 2. A counter is incremented so that 
the next WRSYS2 will load data into the next 
sub-cache line of cache line 2. 

(continued on next page) 
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Table 2-10 (Cont.) sysCmd<2:0> and subCmd<1 :0> Encodings 



sysCmd subCmd Mnemonic 



Function 



101 11 WRSYS3 Data in tlie sysData pad iatclies is loaded into 

tlie memory write buffer, wliicli represents 
caclie line 3. A counter is incremented so that 
the next WRSYS3 will load data into the next 
sub-cache line of cache line 3. 



110 00 WRDM ASO Data in the sysData pad latches is merged with 

the DMA write buffers and loaded into the 
memory write buffer, which represents cache 
line 0. A counter is incremented so that the 
next WRDMASO will load data into the next 
sub-cache line of cache line 0. 

110 01 WRDMASl Data in the sysData pad latches is merged with 

the DMA write buffers and loaded into the 
memory write buffer, which represents cache 
line 1. A counter is incremented so that the 
next WRDMASl will load data into the next 
sub-cache line of cache line 1. 

110 10 WRDMAS2 Data in the sysData pad latches is merged with 

the DMA write buffers and loaded into the 
memory write buffer, which represents cache 
line 2. A counter is incremented so that the 
next WRDMAS2 will load data into the next 
sub-cache line of cache line 2. 

110 11 WRDMAS3 Data in the sysData pad latches is merged with 

the DMA write buffers and loaded into the 
memory write buffer, which represents cache 
line 3. A counter is incremented so that the 
next WRDMAS3 will load data into the next 
sub-cache line of cache line 3. 



Ill 00 WRDMAMO Data in the memory read buffer is merged with 

the DMA write buffers and loaded into the 
memory write buffer, which represents cache 
line 0. A counter is incremented so that the 
next WRDMAMO will load data into the next 
sub-cache line of cache line 0. 

(continued on next page) 
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Table 2-10 (Cont.) sysCmd<2:0> and subCmd<1 :0> Encodings 



sysCmd subCmd Mnemonic 



Function 



111 01 WRDMAMl Data in tlie manory read buffer is merged witli 

tlie DMA write buffers and loaded intotiie 
memory write buffer, wliicli represents caciie 
line 1. A counter is incremented so that the 
next WRDMAM 1 will load data into the next 
sub-cache line of cache line 1. 

Ill 10 WRDMAM 2 Data in the memory read buffer is merged with 

the DMA write buffers and loaded into the 
memory write buffer, which represents cache 
line 2. A counter is incremented so that the 
next WRDMAM 2 will load data into the next 
sub-cache line of cache line 2. 

Ill 11 WRDMAM 3 Data in the memory read buffer is merged with 

the DMA write buffers and loaded into the 
memory write buffer, which represents cache 
line 3. A counter is incremented so that the 
next WRDMAM 3 will load data into the next 
sub-cache line of cache line 3. 



2.2.4.8 memCmd<3:1> 



Signal Type: 21071-CA Output 
Signal Destination: 21071-BA 
Output Clock Edge: clk2R 
Input Sampling Clock Edge: clklR 

The memCmd<3:l> signals indicate to the 21071-BA chips the action to take 
on the memData bus. memCmd<3:l> is driven by the 21071-CA chip on clk2R 
and latched by the 21071-BA chip on clklR. 

The bits are decoded into various actions. Table 2-11 provides a complete 
description of the memCmd<3:l> encodings. 
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Table 2-11 nnemCmd<3:1> Encodings 



memCmd 



Mnemonic 



Function 



010 
Oil 



NOP 
RESET 



No operation. 

All memory pointers in tlie 21071-BA chip are 
reset. 



000 



001 



RDIMM 



RDDLY 



Read data is loaded into the read/merge 
buffer on the next memClkR. A counter is 
incremented so that the next RDxxx will load 
data into the next available sub-cache line of 
the read buffer. 

Read data is loaded into the read/merge buffer 
on the memClkR after the next memClkR. A 
counter is incremented so that the next RDxxx 
will load data into the next available sub-cache 
line of the read buffer. 



100 



101 



WRIMM 



WRDLY 



Data from the memory write buffer is driven 
to memory on the next memClkR. A counter is 
incremented so that the next WRxxx will drive 
the next sub-cache line to memory. 

Data from the memory write buffer is driven 
to memory on the memClkR after the next 
memClkR. A counter is incremented so that the 
next WRxxx will drive the next sub-cache line 
to memory. 



110 



WRIMML 



111 



WRDLYL 



Data from the memory write buffer is driven 
to memory on the next memClkR. After the 
write, the quadword pointer is reset to 0, and 
the cache line pointer is incremented so that 
the next WRxxx will drive the first sub-cache 
I i ne of the next cache I i ne to memory. 

Data from the memory write buffer is driven 
to memory on the memClkR after the next 
memClkR. After the write, the quadword 
pointer is reset to 0, and the cache line pointer 
is incremented so that the next WRxxx will 
drive the first sub-cache line of the next line to 
memory. 
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2.2.5 Memory Signals 

This section describes the memory signals. 

2.2.5.1 memAdr<11:0> 

Signal Type: 21071-CA Output 
Signal Destination: IMemory 
Output Clock Edge: memCII<R 

memAdr<ll:0> is the time multiplexed address bus that provides the row and 
column addresses to the memory. 

2.2.5.2 memRAS_l<8:0> 

Signal Type: 21071-CA Output 

Signal Destination: Memory 

Output Clock Edge: memClkR (Programmable) 

memRAS_l<8:0> is asserted on memory read or write transactions and 
video serial register loads to indicate the presence of a valid row address 
on memAdr<ll:0>. Each memRAS_l<8:0> signal corresponds to one of the nine 
banksets as determined by the memory address decode logic. memRAS_l<8:0> 
is asserted on memory reads and writes only if the subbank number is 0, or 
if subbanks for that bank are disabled (Bx_SU BEN A=0). On memory refresh 
transactions, memRAS_l<8:0>is asserted. 

2.2.5.3 memRASB_l<8:0> 

Signal Type: 21071-CA Output 

Signal Destination: Memory 

Output Clock Edge: memClkR (Programmable) 

memRASB_l<8:0> functions similarly to the memRAS_l<8:0> signals, except 
that memRASB_l<8:0> is asserted on memory reads and writes only if the 
subbank number is 1. If subbanks for that bank aredisabled (Bx_SUBENA=0), 
the memRASB_l line of that bank will assert only for refreshes. 

2.2.5.4 memCAS_l<3:0> 

Signal Type:21071-CA Output 

Signal Destination: Memory 

Output Clock Edge: memClkR (Programmable) 
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memCAS_l<3:0> signals are used during memory reads and writes to indicate 
that a valid column address is on memAdr<ll:0>. During memory writes, 
memCAS_l<3:0> asserts if the respective memory longwords are being written. 
On memory reads, all memCAS_l bits are asserted. memCAS_l<3:0> is also 
asserted during refreshes and video serial register loads. 



2.2.5.5 memWE l<1:0> 



Signal Type: 21071-CA Output 

Signal Destination: Memory 

Output Clock Edge: memClkR (Programmable) 

memWE_l<l:0> signals are asserted on a memory write transaction to indicate 
that valid write data is present on the memData outputs. memWE_l<0>and 
memWE_l<l>are identical copies provided to reduce loading. 



2.2.5.6 memPDCIk 



Signal Type: 21071-CA Output 

Signal Destination: Presence Detect Shift Register 

Output Clock Edge: clk2R 

memPDCIk provides a clock at one-fourth the clkl frequency. This clock 
is connected to the presence detect shift registers. memPDLoad_l and the 
sampling of memPDDIn are referenced to memPDCIk. memPDCIk starts as 
soon as reset_l is deasserted, and discontinues after all data has been shifted 
into the presence detect Control Status Registers (CSRs). 



2.2.5.7 memPDLoad I 



Signal Type: 21071-CA Output 

Signal Destination: Presence Detect Shift Register 

Output Clock Edge: clk2R 

memPDLoad_l asserts to indicate that the presence detect pins should be 
loaded into the presence detect shift register. When memPDLoad_l is asserted, 
at least one memPDCIk will occur. This enables the use of either asynchronous 
or synchronous loading shift registers. 
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2.2.5.8 memPDDIn 

Signal Type: 21071-CA Input 

Signal Source: Presence Detect Shift Register 

Input Clock Edge: cll<2R 

Tine memPDDIn signal contains the data from the presence detect shift 
register. The value of memPDDI n is shifted into the 21071-CA chip presence 
detect registers one sysClock after memPDCIk deasserts (which is three 
sysClocks after memPDCIk asserts). The data is loaded Most Significant 
Bit (MSB) first into the registers (a shift right). 

2.2.6 Video Support Signals 

This section describes the video support signals. 

2.2.6.1 vFrameJ 

Signal Type: 21071-CA I nput 
Signal Source: External logic 
Input Clock Edge: Asynchronous 

Assertion of vFrameJ causes the video display pointer to be loaded with the 
contents of the video frame pointer register which is located in the 21071-CA 
chip. A full serial register load to the video bank is requested at the video 
display pointer address. 

The vFrame_l signal is edge sensitive and asynchronous with the 21071-CA 
chip clocks. Assertion of vFrame_l is detected and synchronized with memClk 
before being used. 

vFrame_l has a weak internal pull-up to support systems that do not use the 
video support functionality provided by the 21071-CA chip. 

2.2.6.2 vRefreshJ 

Signal Type: 21071-CA I nput 
Signal Source: External logic 
Input Clock Edge: Asynchronous 

Assertion of vRefresh_l causes the incremented value of the video display 
pointer to be latched into the video display pointer. A split serial register load 
cycle to the video bank is requested at the video display pointer address. 

The vRefresh_l signal is edge sensitive and asynchronous with the 21071-CA 
chip clocks. Assertion of vRefresh_l is detected and synchronized with memClk 
before being used. 
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VRefresh_l has a weak internal pull-up to support systems that do not use the 
video support functionality provided by the 21071-CA chip. 



2.2.6.3 memDTOE I 



Signal Type: 21071-CA Output 
Signal Destination: Memory 
Output Clock Edge: memClkR 

The memDTOE_l signal has two functions and is intended to be used only by 
the single video bank. During random access reads and writes, memDTOE_l is 
held deasserted before asserting memRAS_l. For random reads, memDTOE_ 
I is asserted with the first column address. During a serial register load, 
memDTOE_l is asserted with the row address. This signal, along with 
memDSF, is used at memRAS_l<8>or memRASB_l<8> assertion by the 
VRAMs to perform full or split register loads. 

2.2.6.4 memDSF 

Signal Type: 21071-CA Output 
Signal Destination: Memory 
Output Clock Edge: memClkR 

The memDSF signal is used at memRAS_l<8> assertion by the single video 
bank to choose between full and split serial register loads. memDSF is driven 
with the row address in order to set up memRAS_l<8>or memRASB_l<8>. 

2.2.7 Miscellaneous/Clock Signals 

This section describes the miscellaneous and clock signals. 

2.2.7.1 wideMem 

Signal Type: 21071-CA Input 
Signal Source: Static 
Input Clock Edge: Static 

The wideMem signal, an input to the 21071-CA and 21071-BA chips, indicates 
the size of the memory data bus. wideMem is tied high to indicate a 128-bit 
wide memory data bus (four 21071-BA chips). wideMem is tied low to indicate 
a 64-bit wide memory data bus (two 21071-BA chips). 

wideMem has a weak internal pull down and a Schmitt trigger input. 
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2.2.7.2 clk1x2 



Signal Type: 21071-CA Input 
Signal Source: Clock Generator 

clklx2 is a clock input which supplies a clock at twice the frequency of the 
DECchip 21064 sysClkOutl signal, with a minimum period of 15 ns and a 50 
percent duty cycle. 



2.2.7.3 clk2ref 



Signal Type: 21071-CA Input 
Signal Source: Clock Generator 

clk2ref is a signal input which is low when the assertion of clklx2 corresponds 
to the assertion of sysClkOutl. The received signal must be set up to the 
assertion of elk 1x2. 



2.2.7.4 reset I 



Signal Type: 21071-CA Input 

Signal Source: External Logic 

Input Clock Edge: Asynchronous on assertion, clklR on deassertion 

Assertion of reset_l sets all internal logic and state machines to their initialized 
states. During reset, the memory data bus is driven, and the sysBus data 
and tag buses are tristated. All signals that are sent to the Alpha 21064 
microprocessor are guaranteed to be tristated or held low, to prevent more than 
3.0 volts from entering the Alpha 21064 microprocessor during reset. 



2.2.7.5 testMode 



Signal Type: 21071-CA I nput 

Signal Source: Test logic 

Input Clock Edge: Asynchronous 

Assertion of testMode places the chip into a mode for chip testing. testMode 
is intended to be used only during chip testing and must be tied low during 
normal system operation. 

testMode has a weak internal pull down and a Schmitt trigger input. 
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2.2.7.6 scanEnable 

Signal Type: 21071-CA Input 
Signal Source: Test logic 

Assertion of scanEnable places all internal flops in their scan state. 
scanEnable is intended to be used only during chip testing and must be tied 
low during normal system operation. 

scanEnable has a weak internal pull down and a Schmitt trigger input. 

2.2.7.7 tristatej 

Signal Type: 21071-CA Input 
Signal Source: External logic 
Input Clock Edge: Asynchronous 

Assertion of this signal tristates all output and bidirectional drivers. tristate_l 
is intended for use only during chip testing and power-up. 

tristate_l has a weak internal pull-up and a Schmitt trigger input. 

2.2.7.8 pTestout 

Signal Type: 21071-CA Output 
Signal Destination: Test logic 
Output Clock Edge: Flow through 

The pTestout signal contains the output from the parametric NAND tree, as 
required for testing. The tristated signal must be asserted for pTestout to be 
valid. pTestout is intended for use only during chip or module testing. 
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2.3 DECchip 21 071-CA Pin Assignment 

TheDECchip 21071-CA is a 208-pin plastic quad flat pacl< (PQFP). Figure 2- 
shows the signal assignments. Sections 2.3.1 and 2.3.2 provide alphabetical 
and numerical pin listings. 
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Figure 2-1 DECchip 21071-CA Pinout Diagram 
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2.3.1 DECchip 21071-CA Alphabetical Pin Assignment List 

Table 2-12 lists the DECchip 21071-CA pins in alphabetical order. The 
following abbreviations are used in theType column of the table: 

• B = Bidirectional 

• I = Input 

• P = Power 

• 0= Output 



Table 2-12 DECchip 21071-CA Alphabetical Pin Assignment List 



Pin Name 



Pin 



Type 



Pin Name 



Pin 



Type 



clklx2 


135 


1 


clk2ref 


132 


1 


cpuCAck<0> 


180 





cpuCAck<l> 


181 





cpuCAck<^> 


182 





cpuCReq<^> 


184 


1 


cpuCReq<a> 


185 


1 


cpuCReq<2> 


186 


1 


cpuCWMaskO> 


168 


1 


cpuCWMask<L> 


169 


1 


cpuCWMask<2> 


170 


1 


cpuCWMask<3> 


171 


1 


cpuCWMask<4> 


173 


1 


cpuCWMask<S> 


174 


1 


cpuCWMask<6> 


175 


1 


cpuCWMask^> 


176 


1 


cpuDinvReq 


177 





cpuDRackO> 


189 





cpuDRack<L> 


190 





cpuDRack<2> 


191 





cpuDWSel<l> 


187 





cpuHoldAck 


179 


1 


cpuHoldReq 


178 





drvMemData 


64 





drvSysCSR 


74 





drvSysData 


73 






InpVdd 


155 


P 


InpVdd 


207 


P 


InpVdd 


103 


P 


InpVdd 


51 


P 


InpVss 


104 


P 


InpVss 


156 


P 


InpVss 


136 


P 


InpVss 


52 


P 


InpVss 


208 


P 


ioCack<0> 


196 





ioCack<a> 


197 





ioCmd<^> 


199 


1 


ioCmd<^> 


200 


1 


ioCmd-^> 


201 


1 


loDataRdy 


198 





ioGrant 


192 





ioRequestO> 


194 


1 


ioRequest<L> 


195 


1 


memAdrO> 


36 





memAdr<L> 


39 





memAdr-^> 


40 





memAdr<3> 


41 





memAdr<4> 


42 





memAdr'^> 


43 





memAdr<6> 


44 





memAdr<i7> 


45 
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Pin Name 



Pin 



Type 



Pin Name 



Pin 



Type 



memAdr'^> 


46 





inemAdr-^> 


47 





memAdr<10> 


48 





memAdr<ll> 


49 





memCAS l<0> 


13 





memCAS l<^> 


14 





memCAS \<2> 


15 





memCAS l<3> 


18 





memCmd-^> 


65 





memCmd<a> 


66 





memCmd<3> 


67 





memDSF 


56 





memDTOE 1 


55 





memPDCIk 


58 





memPDDin 


57 


1 


memPDLoad 1 


59 





memRASB \<0> 


3 





memRASB l<L> 


4 





memRASB \<2> 


5 





memRASB l<3> 


6 





memRASB l<4> 


7 





memRASB l-^> 


8 





memRASB l<6> 


9 





memRASB \<7> 


10 





memRASB l<^> 


11 





memRAS l«^> 


23 






memRAS 


I<1> 


24 


memRAS 


l<2> 


25 


memRAS 


l<3> 


28 


memRAS 


l<4> 


29 


memRAS 


l<5> 


30 


memRAS 


l<6> 


31 


memRAS 


\<7> 


32 


memRAS 


l<8> 


33 


memWE 1 


<0> 


34 


memWE 1 


<1> 


35 


nc* 




188 


nc* 




78 


outVdd 




19 


outVdd 




27 


outVdd 




79 


outVdd 




183 


outVdd 




17 


outVdd 




38 


outVdd 




54 


outVdd 




158 


outVdd 




106 


outVdd 




131 


outVdd 




2 


outVss 




1 


outVss 




37 


outVss 




120 



o 
o 
o 
o 
o 
o 
o 
o 
o 
o 



*nc— Do not connect these pins on board. 
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Pin Name 



Pin 



Type 



Pin Name 



Pin 



Type 



outVss 


16 


P 


outVss 


68 


P 


outVss 


22 


P 


outVss 


50 


P 


outVss 


12 


P 


outVss 


172 


P 


outVss 


105 


P 


outVss 


89 


P 


outVss 


157 


P 


outVss 


141 


P 


outVss 


26 


P 


outVss 


193 


P 


outVss 


53 


P 


pTestout 


203 





reset_l 


206 


1 


scanEnable 


130 


1 


subCmdAO> 


62 





subCmdA<L> 


63 





subCmdB<0> 


60 





subCmdB<l> 


61 





subCmdCommon 


75 





sysAdr<LO> 


112 




sysAdr<ll> 


113 




sysAdr<^2> 


114 




sysAdr<^3> 


115 




sysAdr<a4> 


116 




sysAdr<l5> 


117 




sysAdr<16> 


118 




sysAdr<^7> 


119 




sysAdr<^£> 


121 





sysAdr<19> 


17? 




sysAdr<20> 


123 




sysAdr<21> 


124 




sysAdr<22> 


125 




sysAdr<23> 


126 




sysAdr<24> 


127 




sysAdr<25> 


128 




sysAdr<26> 


129 




sysAdr<27> 


137 




sysAdr<28> 


138 




sysAdr<29> 


139 




sysAdr<30> 


140 




sysAdr<31> 


142 




sysAdr<32> 


143 




sysAdr<33> 


144 




sysAdr<5> 


107 




sysAdr'^> 


108 




sysAdr«i7> 


109 




sysAdr<^> 


110 




sysAdr<^> 


111 




sysCmd<0> 


69 





sysCmd<l> 


70 





sysCmd-^> 


71 





sysDataAH E n 


80 





sysDataAL E n 


77 





sysDataLongWE 


21 





sysDataOEEn 


84 





sysDataWE E n 


20 





sysDataO> 


102 


B 


sysData<l> 


101 


B 
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Pin Name 



Pin 



Type 



Pin Name 



Pin 



Type 



sysData<^0> 


92 


B 


sysData<^l> 


91 


B 


sysData<^2> 


90 


B 


sysData<^3> 


88 


B 


sysData<a4> 


87 


B 


sysData<a5> 


86 


B 


sysData-^> 


100 


B 


sysData<3> 


99 


B 


sysData<4> 


98 


B 


sysData<2> 


97 


B 


sysData<6> 


96 


B 


sysData<^> 


95 


B 


sysData<^> 


94 


B 


sysData-^> 


93 


B 


sysDOE 


85 





sysEarlyOEEn 


83 





syslORead 


76 





sysReadOW 


72 





sysTagOE E n 


82 





sysTagWE 


81 





tagAdrP 


148 


B 


tagAdr-^7> 


149 


B 



tagAdr<^£> 


150 


tagAdr<^9> 


151 


tagAdr-^0> 


152 


tagAdr<21> 


153 


tagAdr<22> 


154 


tagAdr<23> 


159 


tagAdr<24> 


160 


tagAdr<25> 


161 


tagAdr-^6> 


162 


tagAdr<27> 


163 


tagAdr<28> 


164 


tagAdr<29> 


165 


tagAdr<30> 


166 


tagAdr<31> 


167 


tagCtID 


146 


tagCtIP 


147 


tagCtIV 


145 


testMode 


134 


triState 1 


133 


vFrame 1 


204 


vRefresh 1 


205 


wideMem 


202 
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2.3.2 DECchip 21071-CA Numerical Pin Assignment List 

Table 2-13 lists the DECchip 21071-CA pins in numerical order. The following 
abbreviations are used in the Type column of the table: 

• B = Bidirectional 

• I = Input 

• P = Power 

• 0= Output 



Table 2-13 DECchip 21071-CA Numerical Pin Assignment List 



Pin Name 



Pin 



Type 



Pin Name 



Pin 



Type 



outVss 


1 


P 


outVdd 


2 


P 


memRASB l«^> 


3 





memRASB l"^> 


4 





memRASB \<2> 


5 





memRASB l<3> 


6 





memRASB l<4> 


7 





memRASB \<5> 


8 





memRASB l<6> 


9 





memRASB I«i7> 


10 





memRASB l<^> 


11 





outVss 


12 


P 


memCAS l<0> 


13 





memCAS I<1> 


14 





memCAS \<2> 


15 





outVss 


16 


P 


outVdd 


17 


P 


memCAS l<3> 


18 





outVdd 


19 





sysDataWEEn 


20 





sysDataLongWE 


21 





outVss 


22 


P 


memRAS \<0> 


23 





memRAS l<L> 


24 





memRAS l<2> 


25 






outVss 


26 


P 


outVdd 


27 


P 


memRAS_l«3> 


28 





memRAS_l<4> 


29 





memRAS_l<5> 


30 





memRAS_l<*> 


31 





memRAS_l<i7> 


32 





memRASJ<^> 


33 





memWE_l<0> 


34 





memWE_l<l> 


35 





memAdr<0> 


36 





outVss 


37 





outVdd 


38 


p 


memAdr-^> 


39 





memAdr-^> 


40 





memAdr<3> 


41 





memAdr<4> 


42 





memAdr'^> 


43 





memAdr'^> 


44 





memAdr-i7> 


45 





memAdr-^> 


46 





memAdr-^> 


47 





memAdr<LO> 


48 





memAdr<^l> 


49 





outVss 


50 


p 
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Pin Name 



Pin 



Type 



Pin Name 



Pin 



Type 



inpVdd 


51 


P 


inpVss 


52 


P 


outVss 


53 


P 


outVdd 


54 


P 


memDTOE 1 


55 





memDSF 


56 





memPDDIn 


57 





memPDCIk 


58 





memPDLoad 1 


59 





subCmdBO> 


60 





subCmdB<l> 


61 





subCmdAO> 


62 





subCmdA<L> 


63 





drvMemData 


64 





memCmd<a> 


65 





memCmd<3> 


66 





memCmd<3> 


67 





outVss 


68 


P 


sysCmdO> 


69 





sysCmd<l> 


70 





sysCmd-^> 


71 





sysReadOW 


72 





drvSysData 


73 





drvSysCSR 


74 





subCmdCommon 


75 





syslORead 


76 





sysDataALEn 


77 





nc* 


78 


- 



outVdd 


79 


P 


sysDataAHEn 


80 





sysTagWE 


81 





sysTagOEEn 


82 





sysEarlyOEEn 


83 





sysDataOEEn 


84 





sysDOE 


85 





sysData<^5> 


86 


B 


sysData<^4> 


87 


B 


sysData<^3> 


88 


B 


outVss 


89 


P 


sysData<12> 


90 


B 


sysData<ll> 


91 


B 


sysData<10> 


92 


B 


sysData<^> 


93 


B 


sysData-^> 


94 


B 


sysData'^> 


95 


B 


sysData'^> 


96 


B 


sysData'^> 


97 


B 


sysData<4> 


98 


B 


sysData<3> 


99 


B 


sysData-^> 


100 


B 


sysData<l> 


101 


B 


sysData<0> 


102 


B 


inpVdd 


103 


P 


inpVss 


104 


P 


outVss 


105 


P 


outVdd 


106 


P 



*nc — Do not connect these pins on board. 
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Pin Name 



Pin 



Type 



Pin Name 



Pin 



Type 



sysAdr'^> 


107 1 


sysAdr<6> 


108 1 


sysAdr<i7> 


109 1 


sysAdr-^> 


110 1 


sysAdr<^> 


111 1 


sysAdr<10> 


n? 1 


sysAdr<11> 


113 1 


sysAdr<12> 


114 1 


sysAdr<13> 


115 1 


sysAdr<^4> 


116 1 


sysAdr<L5> 


117 1 


sysAdr<16> 


118 1 


sysAdr<17> 


119 1 


outVss 


120 F 


sysAdr<18> 


121 1 


sysAdr<19> 


12? 1 


sysAdr<^0> 


173 1 


sysAdr<21> 


124 1 


sysAdr<22> 


17S 1 


sysAdr<23> 


126 1 


sysAdr<24> 


127 1 


sysAdr<25> 


128 1 


sysAdr<26> 


129 1 


scanEnable 


130 1 


outVdd 


131 F 


clk2Ref 


132 1 


tri5tate_l 


133 i 


testMode 


134 1 


clklx2 


135 1 


inpVss 


136 F 


5ysAdr-^7> 


137 1 



sysAdr<^8> 


138 


1 


sysAdr<29> 


139 


1 


sysAdr<30> 


140 


1 


outVss 


141 


P 


sysAdr<31> 


142 


1 


sysAdr<32> 


143 


1 


sysAdr<33> 


144 


1 


tagCtIV 


145 


B 


tagCtID 


146 


B 


tagCtIP 


147 


B 


tagAdrP 


148 


B 


tagAdr<a7> 


149 


B 


tagAdr<L8> 


150 


B 


tagAdr-^9> 


151 


B 


tagAdr<20> 


152 


B 


tagAdr-^l> 


153 


B 


tagAdr<22> 


154 


B 


inpVdd 


155 


P 


inpVss 


156 


P 


outVss 


157 


P 


outVdd 


158 


P 


tagAdr<^3> 


159 


B 


tagAdr<^4> 


160 


B 


tagAdr<25> 


161 


B 


tagAdr<26> 


162 


B 


tagAdr<27> 


163 


B 


tagAdr<28> 


164 


B 


tagAdr<29> 


165 


B 


tagAdr<30> 


166 


B 


tagAdr<31> 


167 


B 


cpuCWMaskO> 


168 


1 
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Pin Name 



Pin 



Type 



Pin Name 



Pin 



Type 



cpuCWMask<l> 


169 


1 


cpuCWMask<^> 


170 


1 


cpuCWMask<3> 


171 


1 


outVss 


172 


P 


cpuCWMask<4> 


173 


1 


cpuCWMask<2> 


174 


1 


cpuCWMask<6> 


175 


1 


cpuCWMask<^> 


176 


1 


cpuDlnvReq 


177 





cpuHoldReq 


178 





cpuHoldAck 


179 


1 


cpuCAckO> 


180 





cpuCAck<L> 


181 





cpuCAck-^> 


182 





outVdd 


183 


p 


cpuCReq<0> 


184 


1 


cpuCReq<l> 


185 


1 


cpuCReq<^> 


186 


1 


cpuDW5el<l> 


187 





nc* 


188 


— 


cpuDRAck<^> 


189 





cpuDRAck<^> 


190 





cpuDRAck<2> 


191 





ioGrant 


192 





outVss 


193 


p 


ioReque5tO> 


194 


1 


ioRequest-^> 


195 


1 



*nc — Do not connect these pins on board. 



ioCAckO> 


196 


ioCAck<l> 


197 


ioDataRdy 


198 


ioCmdO> 


199 


ioCmd<l> 


200 


ioCmd-^> 


201 


wideMem 


202 


pTestout 


203 


vFrame 1 


204 


vRefresh 1 


205 


reset 1 


206 


inpVdd 


207 


inpVss 


208 



o 
o 
o 



2.4 DECchip 21071-CA Mechanical Specifications 

Figure 2-2 shows the DECchip 21071-CA package dimensions. 
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Figure 2-2 DECchip 21071-CA Package Dimensions 



O 



208 PQFP 




DIM 


Millimeters 


Inches | 


MIN 


IVIAX 


MIN 


MAX 


A 


30.50 


30.77 


1.201 


1.211 


B 


27.90 


28.10 


1.098 


1.106 


C 


30.50 


30.77 


1.201 


1.211 


D 


27.90 


28.10 


1.098 


1.106 


G 


0.23 


0.33 


0.009 


0.013 


H 


.500 BSC 


0.0197 BSC 1 


J 


0.45 


0.62 


0.018 


0.024 


K 


3.45 


3.85 


0.136 


0.152 


L 


0.13 


0.23 


0.005 


0.009 


M 


0.25 


0.35 


0.010 


0.012 


R 


25.5 REF 


1.004 REF 


S 


25.5 REF 


1.004 REF 
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3 

DECchip 21071-CA Architecture Overview 



This chapter describes the DECchip 21071-CA architecture. The 21071-CA 
chip provides both second-level cache and memory control functions. The 
21071-CA chip also controls the cache/memory data path located on the 
21071-BA chip. Figure 3-1 shows a block diagram of the 21071-CA chip. 

Figure 3-1 DECchip 21071-CA Block Diagram 
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3.1 sysBus Interface Architecture 

The CPU, 21071-DA chip, 21071-BA chips, cache, and 21071-CA chip 
communicate with each other over the sysBus. The sysBus is essentially 
the processor pinbus with additional signals for DMA transaction control, 
arbitration, and cache control . 

The sysBus interface contains: 

• sysBus arbiter 

• B cache controller 

• Write buffer address and control 

• Read/merge buffer control 

• Lock register 

3.1.1 sysBus Arbitration 

The 21071-CA chip arbitrates between the CPU and 21071-DA chip, which 
request use of the sysBus and the Bcache when they have a transaction to 
perform. The CPU node has default ownership of the sysBus so that it can 
access the Bcache whenever the 21071-DA chip is not requesting the bus. 

3.1.1.1 Arbitration CSRs 

The arbitration policy of the 21071-CA chip can be programmed by setting up 
the DM A_ARB CSR field to select whether the CPU or the 21071-DA chip has 
highest priority. There are three possible priority encodings: 

• CPU priority 

When the CPU and DMA are simultaneously requesting the sysBus, the 
CPU is given the priority. 

• DMA priority 

DMA is given priority over the CPU, and the bus is released to the cache 
on DMA cache misses or noncacheable DMA transactions. 

• DMA strong priority 

DMA is given priority over the CPU. If another ioRequest<l:0> is pending, 
the bus is not released to the cache on DMA cache misses or noncacheable 
DMA transactions. 
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3.1.1.2 DECchip 21071-DA Requests 

The 21071-CA arbiter monitors requests for the sysBus by decoding the 
cpuCReq<2:0>and ioRequest<l:0> fields. cpuCReq<2:0> is not a bus request; 
it is a cycle command that indicates that the CPU has started a transaction on 
the sysBus. 

When the 21071-CA arbiter detects the assertion of ioRequest<L:0> and when 
DMA has won arbitration, it mal<es a request to the CPU for control of the 
Bcache by asserting cpuHoldReq to the CPU. 

The 21071-DA chip can make three types of requests for the sysBus: 

• Atomic Request 

This request is used if the 21071-DA chip wants to do multiple transactions 
without interruption from the CPU. When the 21071-DA chip already 
has a DMA transaction in progress, the assertion of atomic request will 
override programmed priority. If the 21071-DA chip does not already have 
a transaction in progress, the assertion of atomic request is equivalent to 
sending a plain DMA request. 

Note 



To guarantee atomicity, the 21071-DA chip must assert an atomic 
request in the cycle that it drives the command on the ioCmd<2:0> 
lines for the first transaction. 



Preempt Request 

This request should be used by the 21071-DA chip for deadlocl< prevention. 
A preempt request causes the arbiter to request the CPU to suspend a 
transaction in progress. If the 21071-DA chip must do multiple DMA 
transactions for deadlock prevention, it must keep preempt request or 
atomic asserted until all deadlocked transactions have completed. 

When the 21071-DA chip changes ioRequest<l:0>from preempt to idle or 
plain DMA request, the arbiter will allow the suspended CPU transaction 
to resume. 
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A preempt request must be used only on CPU transactions addressed to 
the 21071-DA chip; that is, I/O reads, I/O writes, fetch, fetchM to 21071-DA 
space, and barriers. Preempt must not be asserted when thesysBus is idle 
or on any transactions not addressed to the 21071-DA chip. 

Note 



Because a preempt request suspends the CPU transaction in progress, 
it should be used only if that transaction cannot complete without the 
completion of the requesting DMA transaction. 



• DMA Request 

This is the ordinary DMA request. No special priority is given to DMA 
request unless the arbiter is so programmed. 

3.1.1.3 Arbitration Cycles 

The cycle in which arbitration occurs depends on whether the CPU or the 
21071-DA chip has control of the bus. Arbitration will occur at the following 
times: 

• When a CPU transaction is in progress, arbitration will occur up to two 
cycles before the assertion of cpuCAcl<<2:0>to the CPU. Table 3-1 shows 
the arbitration cycles of CPU transactions. 

If the arbiter receives ioRequest<l:0> at this time, the 21071-DA is granted 
(independent of programmed priority), and cpuHoldReq is asserted to get 
control of the Bcache. 

Table 3-1 Arbitration Cycles of CPU Transactions 

Two Cycles Before cpuCAck One Cycle Before cpuCAck 

CPU read block, CSR or memory CPU read block, I/O space 

CPU write block, CSR or memory CPU write block, I/O space 

CPU fetch, CSR or memory CPU fetch, I/O space 

CPU STx_C hit CPU STx_C fail 

CPU LDx_L hit dirty CPU memory barrier 

— Any error 
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• When a DMA transaction is in progress, arbitration will occur one cycle 
before ioCAcl«l:0> is sent to tine 21071-DA clnip. The result of arbitration 
depends on programmed priority if both the CPU and the 21071-DA chip 
are requesting the bus. 

• When the sysBus is idle, arbitration occurs every cycle. When a sysBus idle 
cycle is followed by requests from both the CPU and the 21071-DA chip, the 
CPU will be granted (independent of programmed priority or the ioRequest 
field). This is because the CPU has already started the transaction on 
the sysBus, and the 21071-DA chip cannot stall write data transfers soon 
enough. 

3.1.1.4 Grant Mechanism 

After the 21071-DA chip has made a request, and the arbiter has determined 
that the 21071-DA chip should be granted the bus, the 21071-CA chip asserts 
ioGrant to the 21071-DA chip and cpuHoldReq to the CPU in the same cycle. 
After cpuHoldReq has been asserted, the 21071-CA and 21071-DA chips must 
ignore cpuCReq<2:0> until the transaction is complete (ioCAck<l:0> has been 
returned) and cpuHoldAck has been deasserted. 

After the 21071-DA chip detects that both ioGrant and cpuHoldAck have been 
asserted, it will drive its command address and data lines as appropriate. 

Note 



The ioCmd<2:0> encodings change as soon as the 21071-DA chip has 
the bus. 



After the 21071-DA chip has received cpuHoldAck, it is expected to take 
away ioRequest <1:0> in the cycle it drives ioCmd<2:0>, unless it has another 
transaction to do. The 21071-DA chip may choose to withdraw ioRequest <1:0> 
without doing a transaction; in this case it should drive IDLE on the 
ioCmd<2:0> pins, until it removes ioRequest <1:0>. If the 21071-DA chip 
withdraws ioRequest <1:0> for one or more cycles after receiving cpuHoldAck, 
performance may be affected, but no other adverse behavior will occur. 

During DMA transactions, the 21071-DA chip will drive the DMA address on 
the sysAdr lines until the 21071-CA chip has completed the Bcache probe and 
latched the DMA address. After the address is latched by the 21071-CA chip, 
the arbiter may decide that it wants to release the cache back to the CPU by 
deasserting cpuHoldReq. 
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When this release 6ec\5\or\ has been made, ioGrant will be deasserted, 
indicating to the 21071-DA chip that it needs to tristate its address lines. The 
arbiter releases the cache on either DMA read or masked write transactions 
that don't use the cache or DMA full write transactions if the memory write 
buffer is full. The release will not occur if the programmed arbitration priority 
is DMA strong and the ioRequest<l:0> lines are non-idle or if the 21071-DA 
chips ioRequest<l:0> lines are driving DMA atomic or DMA preempt. 

3.1.1.5 Releases 

When the cache has been released (during a DMA transaction in progress), 
arbitration occurs one cycle before ioCAcl<<l:0> is sent to the 21071-DA chip. 
It could also occur up to one cycle after ioCacl«l:0>, if ioCacl<<l:0> occurred 
while the sysBus was being released. The result of arbitration depends on 
programmed priority if both the CPU and 21071-DA chip are requesting the 
bus. 

3.1.2 Bcache Control 

The Bcache controller provides control for the secondary cache on CPU- 
initiated memory read/write transactions that miss and on all CPU-initiated 
memory LDx_L and STx_C transactions (hits and misses). On DMA initiated 
transactions, the Bcache controller provides control for probing the cache and 
extracting or invalidating the cache line when required. The 21071-CA chip 
supports only a write-back cache. 

Figure 3-2 shows the implementation of a cache subsystem with a 512 KB 
cache. 
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Figure 3-2 Cache Subsystem for a 512 KB Cache 
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The following sections describe the salient features of the B cache controller. 

3.1.2.1 Bcache Width, Size, and Speed 

The 21071-CA chip supports only a secondary cache width of 128 bits. A 64-bit 
wide cache is not supported. 

The Bcache controller can support Bcache sizes from 128 KB to 16 MB. The 
controller needs to know the Bcache size to perform a Tag compare on the 
appropriate bits. The 21071-CA chip uses a register to enable the appropriate 
bits of the tag address. Software is required to program this register based on 
the size of the cache. Refer to Section 4.2.3 for additional information. 
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The only restriction that the Bcache controller places on the speed of the 
Bcache is that a 21071-CA initiated read from the cache RAMs be completed in 
one sysClk cycle. Bcache writes can be programmed to take one or two sysClk 
cycles. 

3.1.2.2 Bcache Allocation Policy 

The 21071-CA chip supports a write-back Bcache (secondary cache). The 
Bcache is allocated on CPU memory read misses. The 21071-CA chip supports 
an optional allocation policy on writes. Allocation on CPU memory writes can 
be turned off by setting a bit in a register. Refer to Section 4.2.1 for additional 
information. 

3.1.2.3 Bcache Write Granularity 

The Bcache controller in the 21071-CA chip supports octaword write 
granularity to the Bcache. This has implications in the way STx_C hit 
transactions are handled. STx_C transactions are either quadword or longword 
in length. Since less than an octaword cannot be written into the cache, the 
21071-CA chip has to perform a read-modify-write transaction on the Bcache 
when an STx_C hits in the cache. 

On partial writes or STx_C transactions that miss in the cache, the 21071-CA 
chip has to merge the write data with data from memory and write it into the 
cache if allocation is enabled. If allocation is disabled, the write is sent directly 
to memory through the memory write buffer. 

3.1.2.4 CPU-Initiated Bcache Operations 

For CPU requests, the 21071-CA chip performs the following operations on the 
Bcache data and tag RAM s: 

Extracts victim blocks from the Bcache into the write buffer when the 
Bcache has to be allocated. 

Writes the Bcache with fill data in parallel with returning it to the CPU 
during a read block to cacheable memory space. 

Writes the Bcache with the updated block of data during a write block to 
cacheable memory space when write allocate mode is enabled. 

Performs a tag probe and compare for LDx_L and STx_C requests. 

Provides data from the Bcache to the CPU on LDx_L transactions that hit 
in the cache. 

Writes the Bcache tag store with the appropriate address and control bits 
during the previously listed operations. 
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3.1.2.5 DMA-Initiated Bcache Operations 

During DMA requests, after the 21071-CA chip— using the 
cpuHoldReq/cpuHoldAck mechanism— has received ownership of the Bcache, 
the chip performs the following operations: 

• Performs a tag probe to determine if the DMA block is in the Bcache. 

• Reads a block of data from the Bcache and loads it into the DMA read 
buffer if a DMA read hits in the Bcache. 

• Reads a block of data from the Bcache, merges it with DMA write buffer 
data, and loads it into the memory write buffer if a DMA write mask 
transaction hits in the Bcache. 

• Invalidates the cache block if a DMA write hits in the Bcache. 

3.1.2.6 External Logic Requirement 

The 21071-CA chip requires external logic (PALs) to generate the controls for 
the cache RAMs. It supplies cache control signals to external PALs which NOR 
them with the CPU cache control signals. The Bcache PALs dock the system 
cache control signals according to the specific timing requirements of that 
system before NORing with the CPU signals. The 21071-CA chip sends data 
and tag RAM output enables, write enables, and lower address bit signals to 
the Bcache PAL logic. 

3.1.2.7 Tag Compare Logic 

As part of its function to support a system with a backup cache, the 21071-CA 
chip is responsible for comparing the upper bits of sysAdr<33:5> with address 
bits stored in the tag RAMs. The 21071-CA chip does this tag comparison 
during LDx_L and STx_C CPU requests and during DMA transactions to 
cacheable memory space. The number of bits that are used in the address 
comparison and the parity check is controlled by the tag enable register in the 
21071-CA chip. I n the case of a system that implements the smallest cache 
size of 128 KB Bcache, the 21071-CA chip compares sysAdr<31:17>to the 
tagAdr<31:17> bits read from the tag RAMs. I n the other extreme of a 16 M B 
Bcache, the 21071-CA chip performs the comparison only on bits<31:24>. 
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3.1.2.8 CPU Primary Cache Invalidates 

The 21071-CA chip Bcache controller is responsible for ensuring that the 
CPU Dcache is always a subset of the external Bcache. Maintaining system 
cache coherency is accomplished by asserting cpuDI nvReq to the CPU at the 
following times: 

• When a valid Bcache block is replaced during a fill of the Bcache with CPU 
I stream read data. 

• When a valid Bcache block is replaced during a fill of the Bcache with write 
allocate data. 

• During a Bcache invalidate that is due to a DMA write or write masked 
transaction that hits in the cache. 

• During all DMA writes when the Bcache is disabled or when no Bcache is 
present in the system. 

The 21071-CA chip assumes that sysAdr<12:5>are logically connected (either 
directly or indirectly) to the CPU cpul nvAdr<12:5> pins so that the correct 
Dcache block is invalidated. 

3.1.3 sysBus Controller 

The sysBus controller consists of a sequencer that receives CPU and DMA 
command fields for decode, results from the sysBus arbiter logic, and status 
from the memory controller logic. The sequencer supplies state that is used to 
generate Bcache control and read requests to the memory controller. The state 
controls the loading of data from the sysBus into the read/merge buffer and 
write buffer, and it acknowledges cycles to the CPU and 21071-DA chip. 

3.1.3.1 Wrapping 

The sysBus controller supports wrapping on the sysBus. On read transactions, 
the requested octaword is returned to the CPU or the 21071-DA chip. 

Note 

Wrapping is not optional in the sysBus controller. The processor must 
be configured with wrapping enabled. 
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3.1.4 Address Decoding 

The 21071-CA sysBus interface logic decodes tine sysBus address for both CPU 
and DIM A requests in order to determine what action needs to be tal<en. It 
supports cacheable and noncacheable memory accesses, as well as accesses to 
its CSR space. 

Table 3-2 provides an exact mapping of this address space. 



Table 3-2 sysBus Address Map 



sysAdr<33:32> sysAdr<31 :29> Address Space 



Notes 



00 



XXX 



Cacheable memory space 



Accessed by the CPU 
instruction stream 
(I stream) or data 
stream (Dstream). 
Accessed by DMA. 



01 



oxx 



Noncacheable memory space 



01 



01 



01 



100 



101 



IIX 



21071-CA CSRs 



Reserved for 21071-DA 



Reserved for 21071-DA 



Accessed by the CPU 
(I stream/Dstream). 
Accessed by DMA; can 
be used for a frame 
buffer on the DRAM 
bus. 

The 21071-CA chip 
will respond to all 
addresses in this 
space. Dstream access 
only. 

The 21071-CA expects 
the 21071-DA to 
respond to addresses 
in this range. CPU 
Dstream access only. 

The 21071-CA expects 
the 21071-DA to 
respond to addresses 
in this range. CPU 
Dstream access only. 

(continued on next page) 
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Table 3-2 (Cont.) sysBus Address Map 



sysAdr<33:32> sysAdr<31 :29> Address Space Notes 

10 XXX Reserved for 21071-DA The 21071-CA expects 

the 21071-DA to 
respond to addresses 
in this range. CPU 
Dstream access only. 

11 XXX Reserved for 21071-DA The 21071-CA expects 

the 21071-DA to 



respond to addresses 
in this range. CPU 
Dstream access only. 



3.1.4.1 Cacheable Memory Space 

00000 0000.. OFFFF FFFF 

The 21071-CA chip recognizes the 4 GB of quadrant (corresponding to sysBus 
address<33:32> = 00) to be cacheable memory space. The 21071-CA chip 
responds to all read/write accesses in this space. If the Bcache is enabled, 
cache probes, allocates, deallocates, and invalidates happen according to 
the protocols described in Chapter 5. Some or all of main memory can be 
programmed to be in this cacheable space. 

3.1.4.2 Noncacheable Memory Space 

10000 0000.. 17FFF FFFF 

The 21071-CA chip recognizes the lower 2 GB of quadrant 1 (corresponding to 
sysBus address<33:32> = 01) to be noncacheable memory space. The 21071-CA 
chip responds to all read/write accesses in this space. The Bcache is bypassed 
by the 21071-DA chip on accesses to this space. Some or all of main memory 
can be programmed to be in this noncacheable space. If a frame buffer is 
supported in system memory, it should be addressed in this region. 

3.1.4.3 21071-CA CSR Space 

18000 0000.. 19FFF FFFF 

The 21071-CA must respond to all accesses in this space. Exact CSR addresses 
are defined in Chapter 4. 



3-12 DECchip 21071-CA Architecture Overview 



3.1.5 Lock Address Register and Lock Bit 

The 21071-CA chip implements the lock address register and lock bit as 
required by the Alpha architecture. The lock register contains sysAdr<32:5> 
and gets loaded with the sysAdr during all LDx_L transactions. The 21071-CA 
chip locks 32 bytes of data at a time. All LDx_L transactions also set the lock 
bit associated with the address register. 

The following conditions clear the lock bit: 

Chip reset 

A DMA write address matches the lock address 

Any STx_C command 

A CPU write to any I/O address (to allow PALcodeto reset the lock flag) 

The assertion of the loClrLock command from the 21071-DA 

This command is used by the 21071-DA to keep the lock flag clear as long 
as memory is locked by a device on the PCI . 

Note 



The state of the lock bit is unpredictable after STx_C and LDx_L 
transactions that have tag parity or non-existent memory errors. 



3.1.6 Memory Write Buffer 

The 21071-CA chip has a memory writer buffer that supports buffering of up to 
four memory write transactions. This write buffer is used to buffer data on its 
way to memory for the following types of transactions: 

• DMA writes 

• Victim data from the Bcache 

• CPU noncacheable memory write data (which includes all CPU writes 
when allocate mode is disabled) 

The 21071-CA chip stores the cache line address, longword masks, memory 
bankset bank numbers, and a cache line valid bit per entry of the memory 
buffer. 
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3.1.6.1 Write Buffer Address Comparison 

The 21071-CA chip architecture allows memory read requests to bypass writes 
as long as the read address does not match an address in the memory write 
buffer. The 21071-CA chip compares the incoming memory read address 
against the addresses of the valid entries of the memory write buffer. 

If there is a match, then the memory controller will continue to dump the 
contents of the write buffer to memory, one cache line at a time, until the write 
buffer hit condition no longer exists. The memory controller is then free to 
start the original memory read transaction, which resulted from the CPU or 
DMA request. 

If there is no match, then the memory read is allowed to proceed ahead of 
the buffered writes. The memory read transaction may be initiated by a CPU 
or DMA read from memory, a DMA masked write transaction, or a partial 
cacheable write transaction from the CPU . 

3.1.6.2 Write Buffer Flushing 

The 21071-CA chip allows the 21071-DA chip to flush the memory write buffer 
with a special DMA command. 

3.1.6.3 Write Buffer Full Condition 

If the memory write buffer is full, then the 21071-CA chip accepts the first data 
from thesysBus and stores it in a temporary latch until one write transaction 
has been retired to memory. The second data is stalled on the bus until 
then. The write buffer full condition can happen on CPU memory writes 
(noncacheable or nonal locate), DMA writes, and victim reads from the cache. 

3.1 .7 Read/Merge Buffer Control 

The 21071-CA chip controls the read/merge buffer from the 21071-BA chip. 
The read/merge buffer is a cache line buffer which is used for four main 
purposes: 

• Buffering read data from memory until the sysBus is ready to receive it. 

• Supporting Bcache write allocation by providing a mechanism to merge 
CPU partial writes to the cache with the rest of the cache line from 
memory. 

• Supporting STx_C transactions that hit in the cache. 

• Supporting LDx_L transactions that hit in the cache. 
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The read/merge buffer consists of two cache line buffers— the read buffer and 
the merge buffer. The read buffer is used to store memory read data, and the 
merge buffer is used to store write data from the CPU or data read from the 
cache. 

During a CPU read block or DMA read transaction, memory data is loaded 
into the read buffer before being sent out to the sysBus or DMA read buffer. 
The read buffer acts as a timing stage to phase align the memory timing to the 
sysBus timing. After the memory controller has loaded an entry of memory 
data into the read buffer, it sets that entry's valid bit to indicate to the sysBus 
controller logic that data is ready to be returned to the sysBus. During these 
memory read transactions, the buffer is also used for storage, because the 
sysBus could be busy transferring victim data from the cache. 

During a cacheable write block transaction with allocate mode enabled, the 
valid longwords of CPU data are loaded into the merge buffer while the 
memory controller is fetching the rest of the cache line. Because data could 
return from memory before all of the CPU data has been loaded, the read and 
merge buffers can be loaded simultaneously. 

During the special case of an STx_C transaction that hits in the Bcache, the 
merge buffer is used to merge the valid longwords of CPU write data with the 
rest of the cache line read from the Bcache. After the data has been merged in 
the buffer, the entire block is then written back to the Bcache. 

During an LDx_L transaction that hits in the Bcache, the 21071-CA reads the 
data from the cache into the merge buffer, and then drives the requested data 
on the sysBus. 

3.1.8 sysBus Transactions 

This section describes the sysBus transactions. 

3.1.8.1 CPU Transactions 

This section describes the CPU transactions. 

• Read Block From Memory 

A read block from memory can be from cacheable or noncacheable memory 
space. Data is read from memory and returned to the CPU. On a cacheable 
read transaction, a victim, if any, is extracted from the cache, and then the 
cache is filled with the memory data. Only one octaword is transferred on 
noncacheable reads. The Dcache is invalidated on Istream reads. 



DECchlp 21071-CA Architecture Overview 3-15 



Read Block to I/O Space 

A read block from I/O space may be directed to the 21071-CA CSR or to 
the 21071-DA chip. On a read block from the 21071-CA CSR, the data is 
returned by the 21071-CA chip. 

A read block that does not fall within the 21071-CA CSR address range is 
assumed to belong to the 21071-DA chip. 

The 21071-DA chip is expected to receive the command request, take 
appropriate action, and notify the 21071-CA chip when data is ready to be 
returned to the CPU. The 21071-CA chip then provides cpuDRack<2:0> 
and cpuCAck<2:0>to theCPU, and the transaction is terminated. 

Note 



The 21071-DA chip cannot directly respond to the CPU with 
cpuDRack<2:0>and cpuCAck<2:0>. It must respond through the 
21071-CA chip. 



An I/O read addressed to the 21071-DA chip can be preempted by the 
21071-DA chip for deadlock resolution. 

Write Block to Memory 

If allocates are turned on and the transaction is to cacheable space, a 
cache fill is performed at the end of the write. The cache is filled with 
data received from the CPU if the whole cache line is being written. I n the 
case of a partial write, CPU write data is merged with memory data before 
writing in the cache. In either case, a victim (if there is one) is extracted 
before the fill. 

If allocates are turned off, or if the write is noncacheable, the write data 
from the CPU is loaded into the write buffer from where it gets written to 
memory. 

Write Block to I/O Space 

A write block to I/O space may be directed to a 21071-CA CSR or the 
21071-DA chip. On a write block to a 21071-CA CSR, data is written to the 
CSR, and the transaction is completed. 

An I/O write block that does not fall within the 21071-CA CSR address 
range is assumed to belong to the 21071-DA chip. The 21071-DA chip 
is expected to notify the 21071-CA chip when the transaction has to be 
terminated and the 21071-CA chip asserts cpuCAck<2:0>to the CPU. 
An I/O write addressed to the 21071-DA chip can be preempted by it for 
deadlock resolution. 



3-16 DECchip 21071-CA Architecture Overview 



• LDxL 

The Bcache controller performs a cache probe. If the address is a miss, 
then the behavior is exactly the same as that of a memory read block, 
except that the cache line address is stored in the lock register and the lock 
flag is set. The same is true of a noncacheable address. 

If the address is a hit, data is read from the cache into the merge buffer 
and is then returned to the CPU . As in the miss case, the lock address is 
captured, and the lock flag is set. 

An LDx_L to I/O space is handled as a read block to I/O space. 

• STxC 

The 21071-CA chip responds only to STx_C transactions that are addressed 
to memory space or to its CSR space. On an STx_C transaction in memory 
space, the state of the lock flag is checked. If the lock flag is clear, the 
STx_C fails, and the transaction is terminated with an STx_C fail CACK. If 
the lock flag is set, the transaction proceeds as outlined below. 

A cache probe is done to detect a hit or a miss. If it hits in the cache, 
the write data is loaded into the merge buffer, and a read of the cache 
is performed. The read data is merged with the write data and is then 
written to the cache. This is necessary because an STx_C transaction is 
always less than an octaword, and the write granularity of the Bcache is 
an octaword. 

If the cache probe failed, the remainder of the flow looks like a write block. 
As in the write block flow, the write data enters the merge buffer if Bcache 
write allocate is enabled, otherwise it is stored in the memory write buffer. 

An STx_C to the 21071-CA chip CSR space is handled as a write block. 
Error checking takes precedence over checking the lock flag. 

• Barrier 

A barrier transaction has no effect on the 21071-CA chip. However, instead 
of terminating the transaction right away, the 21071-CA chip allows the 
21071-DA chip to respond to a barrier. Therefore, the 21071-DA chip has 
to notify the 21071-CA chip when it wants the barrier terminated. 

Note 



The 21071-CA chip requires the 21071-DA chip to respond to a barrier 
instruction using ioCAck<l:0>. Failure to comply with this condition 
will cause the transaction to hang. 
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• Fetch, FetchM 

A fetch, fetchM transaction has no effect on the 21071-CA chip. If a fetch or 
a fetchIM is within memory or the 21071-CA CSR space, the transaction is 
simply acl<nowledged as OK. The 21071-DA chip must decode and request 
acl<nowledgment of fetch and fetchM transactions if they are within the 
21071-DA chip address space. 

3.1.8.2 DMA Transactions 

After DMA wins arbitration, it may request a transaction with the 21071-CA 
chip. Unlil<e the CPU transactions, the only unit of transfer for DMA 
transactions is a cache line. 

• DMA Read 

A DMA read command is sent by the 21071-DA chip to indicate that 
it wants the lower octaword of the cache line first, followed by the 
upper octaword. The whole cache line is always returned. A DMA read 
transaction to cacheable space causes the Bcache controller to do a cache 
probe. If the address hits in the cache, data is read from the cache and 
returned to the 21071-DA chip. If the address is noncacheable or if the 
address misses in the cache, the data is read from memory. 

• DMA Read Wrapped 

The only difference between a DMA read and DMA read wrapped 
transaction is that the requested data in the DMA read wrapped 
transaction is the upper octaword in the cache line, which should be 
returned first. 

• DMA Read Burst 

The DMA read burst command is similar to the DMA read command. It is 
used by the 21071-DA chip to give a page mode hint to the 21071-CA chip, 
and it may cause the memory controller to remain in page mode at the end 
of this read transaction. 

• DMA Read Wrapped Burst 

The DMA read wrapped burst command is similar to the DMA read 
wrapped command. It is used by the 21071-DA chip to give a page mode 
hint to the 21071-CA chip, and it will cause the memory controller to 
remain in page mode at the end of this read transaction. 
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• DMA Write Full 

This command indicates that the whole cache line has to be written to 
memory. If the address is in cacheable space, the cache is probed. If there 
is a cache hit, the corresponding location is invalidated in the Bcache and 
Dcache. The write data is loaded into the write buffer from where it is 
written to memory. Except for the cache invalidate, the operation is the 
same on noncacheable writes or cache miss writes. 

If the Bcache is disabled (bc_En clear) or not present on the system, every 
DMA write will cause a CPU data cache (Dcache) invalidate. 

• DMA Write Masked 

The 21071-DA chip requests a DMA write masked transaction when only 
a subset of the bytes in a cache line are to be written. The 21071-CA chip 
begins the transaction by performing a DMA read. As the read data is 
received from the Bcache or memory, it is merged with DMA write data 
and loaded into the memory write buffer. If the cache was hit, the cache is 
invalidated. 

If the Bcache is disabled (bc_En clear) or not present on the system, every 
DMA write will cause a CPU Dcache invalidate. 

• DMA Flush 

This command should be used by the 21071-DA chip when it wants to 
flush the memory write buffer. The 21071-CA chip will acknowledge the 
transaction after all buffered writes have been written to memory. 

3.1.9 Error Handling 

During CPU and DMA transactions, the 21071-CA chip detects the following 
errors: 

• Bcache tag address parity error 

• Bcache tag control parity error 

• Non-existent memory error 

When one or more errors are detected on a transaction, the 21071-CA 
chip signals the errors to the CPU or the 21071-DA chip at the end of the 
transaction by acknowledging hard error on the cpuCAck<2:0>or ioCAck<l:0> 
field. The current sysAdr<33:5> is logged in the error address register and 
error status is logged in the error and diagnostics status register. These CSRs 
are locked until the CPU clears all the error status bits by writing the CSR. 
Refer to Chapter 4 for additional information. 



DECchip 21071-CA Architecture Overview 3-19 



If errors cxcur on a transaction while the error address and status are locl<ed, 
the transaction is acknowledged with hard error on the cpuCAck<2:0> or 
ioCAck<l:0> command fields. The LostErr bit in the error and diagnostics 
status Register is set, and neither the error address nor the error status of the 
lost error are recorded. 

The hard error indication overrides STx_C fail. The lock bit is unpredictable 
after LDx_L transactions that have errors. 

3.2 Memory Controller 

This section describes memory organization and memory controller features. 

3.2.1 DRAM and SIMM Requirements 

The I/O pins for all the SIMMs or RAMs must beTTL compatible. DRAM 
output drivers are controlled by using only the memCAS_l and memWE_l 
pins. The VRAM drivers use memDTOE_l and memDSF pins in addition to 
the memCAS_l and memWE_l pins. TheOE_l pins on the DRAMs should be 
grounded. A separate CAS per longword must be used at the RAMs. CAS- 
before-RAS refresh must be supported. The expected RAS-access time is 50 ns 
to 100 ns, with page mode CAS-access time between 10 ns and 50 ns. 

3.2.2 Memory Organization 

The 21071-CA chip supports between 8MB and 4 GB of dynamic random- 
access memory (DRAM) and an additional 1 MB to 8 MB of dual port random- 
access memory (VRAM ). 

Memory can be accessed in two widths— 64 bits and 128 bits. The actual 
number of bits required is higher depending on the mode of error detection. 
Longword parity requires 66 or 132 bits, and longword ECC requires 78 or 156 
bits corresponding to 64-bit and 128-bit wide memory respectively. 

The 21071-CA chip supports up to 8 banksets of DRAM and 1 bankset of 
VRAM. Each bankset can be made up of one or two banks. A bank of memory 
refers to one width of DRAMs. It may be implemented using SIMMs or by 
directly soldering DRAMs on the module. A SIMM implementation requires 
more than one SI M M to form one memory bank. For instance, four 33-, 36-, 
or 40-bit SIMMs would be required to form a bank width of 128. The two 
banks in a bankset should be identical in configuration, size, and speed. The 
21071-CA chip has a pair of RAS signals that corresponding to a bankset— 
memRAS_l and memRASB_l. Each bank in a bankset should be connected to 
one of these RAS pins. If the bankset has only one bank of RAMs, memRASJ 
should be used, and memRASB I should be left unconnected. 
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Figure 3-3 shows the memory set organization. 



Figure 3-3 Memory Set Organization 



1 ) Each Bankset has a pair of RASes, memRAS_l<8:0> and memRASB_l<8:0> 

2) With 64-bit memory, only memCAS_l<1:0> are used. 
With 128-bit memory, memCAS_l<3:0> are used. 

3) memAdr and memWEL are shared by all sets and subsets. 
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3.2.2.1 Memory Bankset Characteristics 

Each memory bankset must conform to the following characteristics: 

• Width: All the banksets in a system must have the same memory width. 

• Banks: The banks in a bankset should be identical in DRAM size and 
speed. 

• Longword writes: Each bankset must support longword write capability. 
The 21071-CA chip generates longword CASes for writes. For banksets 
implemented using 33-, 36- or 40-bit SIMMs, each SIMM should receives 
unique memCASJ pin. Table 3-3 shows the CAS connections. 

• Address Range: Each bankset has a programmable base address and 
size. The base address of a bankset must be aligned to the natural size 
boundary. For example, an 8 MB bankset must start on an 8 MB boundary. 
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Table 3-3 Longword Number to memCAS_l[n] Correspondence 



Memory 
Width 



<0> 



memCASJ 
<1> <2> 



<3> 



64 



128 



LWO 


LWl 


Unused 


Unused 


LW2 


LW3 


Unused 


Unused 


LW4 


LW5 


Unused 


Unused 


LW6 


LW7 


Unused 


Unused 


LWO 


LWl 


LW2 


LW3 


LW4 


LW5 


LW6 


LW7 



A detailed description of the banksets is given in the following sections. 

3.2.2.2 BanksetO..Bankset7 

BanksetO through bankset? are intended for DRAMs; they have the same 
features. 

• DRAM Type: IM x 1, IM x 4, 4M x 1, 4M x 4 and 16M x 1 DRAMs are 
supported. Both symmetrical (11,11) and asymmetrical (12,10) addressing 
for 16 MB DRAMs are supported. Typical expected RAS access time is 
50 to 100 ns. CAS-before-RAS refresh is used to refresh all banksets 
simultaneously. 

• Bankset Size (MB): A bankset may be made up of 1 or 2 banks, giving a 
total of IM, 2M, 4M, 8M, 16M or 32M addressable locations depending on 
the depth of the DRAMs used. Each location consists of 8 bytes for 64-bit 
memory or 16 bytes for 128-bit memory. Table 3-4 lists supported bankset 
sizes and the possible DRAM configurations that can be used to get these 
sizes. 

Table 3-4 Supported Bankset Sizes and DRAM Configurations for Different 
Memory Widthis 



Locations 
in Banl<set 



Banl<set Size 
64-Bit 128-Bit 



Number of 
Subbanks 



DRAM 
Configurations 



IM 


8 MB 


16 MB 


1 


IM xl/lM x4 


2IV| 


16 MB 


32 MB 


2 


IM x 1 / IM x 4 


4IVI 


32 MB 


64 MB 


1 


4M X 1 / 4M X 4 


8IVI 


64 MB 


128MB 


2 


4M X 1 / 4M X 4 


16IVI 


128 MB 


256MB 


1 


16M xl 


32M 


256 MB 


512 MB 


2 


16M xl 



3-22 DECchip 21071-CA Architecture Overview 



3.2.2.3 BanksetS 

A single, fixed bankset location for VRAMs simplifies the support logic and 
reduces CSR bits. As banksetS provides from 1 M B to 8 M B of VRAM , more 
than one VRAM bankset is not required. 

• VRAM Type: 128K x 4, 128K x 8, 256K x 4, and 256K x 8 VRAMs are 
supported. The number of rows in the VRAM must be 512. This is required 
for the video display pointer logic to increment correctly. Typical expected 
RAS-access-time to the RAM port of the VRAM is 50 ns to 100 ns. CAS- 
before-RAS refresh is used. 

• Bankset8 Size: Bankset8 can have 1 or 2 banks giving a total of 128K, 
256K or 512K addressable locations. This provides 1 M B, 2 M B, or 4 M B of 
VRAM for 64-bit memory; and 2 M B, 4 M B, or 8 M B for 128-bit memory 

3.2.2.4 Supported Memory SIMMs 

The 21071-CA chip supports industry-standard 33-, 36-, and 40-bit SIMMs. 33- 
and 36-bit SIMMs are used when longword parity is the error detection mode, 
and 40-bit SIMMs are used when longword ECC is used. Table 3-4 lists the 
DRAM sizes and widths that are supported. Split RAS SIMMs are supported 
by the 21071-CA chip. Split RAS SIMMs have two banks of RAMs, one on 
each side. A split RAS SIMM can therefore be considered as a bankset with 
two banks, and the corresponding memRAS_l and memRASB_l can be used to 
select between either side of the SIMM. 

3.2.3 Memory Address Generation 

Note 



The programmable base address of a bankset must be aligned to the 
natural size boundary. For example, an 8 MB bankset must start on an 
8MB boundary. The hardware allows for holes in memory with badly 
programmed addresses. 



This section describes the generation of row and column addresses from the 
address originating on the sysBus, that is, the physical address PA<33:5>. The 
21071-CA chip sysBus interface decodes accesses to memory space and the 
21071-CA chip I/O space. The physical addresses received by the 21071-CA 
chip memory control logic are always in memory space. 

For memory reads, the address comes directly from sysAdr<33:5>. For memory 
writes, the write buffer provides the initial value of PA<33:4>. For video serial 
register loads, the address is derived internally. 
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Each bankset has a programmable base address and size. The incoming 
physical address is compared in parallel with the memory ranges of all 
banksets present. Depending on the size of the bankset, a variable number of 
PA and base address bits from the CSR are compared. Table 3-5 describes the 
base address bits and the subbank bit for the allowed bankset sizes. 



Table 3-5 Base Address Comparison 



Bankset Size Compared 



Subbanl< 



512 MB 

256MB 

128MB 

64MB 

32 MB 

16MB 

8MB 

4MB 

2 MB 

1 MB 



PA<33:29> 


PA<28> 


PA<33:28> 


PA<27> 


PA<33:27> 


PA^6> 


PA<33:26> 


PA^5> 


PA<33:25> 


PA^4> 


PA<33:24> 


PA<23> 


PA<33:23> 


PA^2> 


PA<33:22> 


PA<21> 


PA<33:21> 


PA^O> 


PA<33:20> 


PA<19> 



Note 



BanksetO through bankset? have a minimum size of SMB. VRAM 
banksets has a maximum size of S MB. 



The memory address depends on the width of memory and the number of row 
and column bits per bankset. Program Sn_ColSel according to Table 3-6 and 
Table 3-7. 



Table 3-6 Row and Column Address Decode for BanksetO..? 



Sn ColSel 



Memory 
Width 



Row,Column 
Bits 



Row Address 



Column Address <11:0> 



000 


64 


12,12 


PA<24:13> 


PA<26,25,12:3> 


001 


64 


12,10 or 
11,11 


PA<24:13> 


PA«,24,12:3> 


on 


64 


10,10 


PA<xx,22:13> 


PAo<x,12:3> 


000 


128 


12,12 


PA<25,24,22:13> 


PA<27,26,23,12:4> 


001 


128 


12,10 or 
11,11 


PA<25,24,22:13> 


PAo<,25,23,12:4> 


on 


128 


10,10 


PA<xx,22:13> 


PA<3<x,23,12:4> 



3-24 DECchip 21071-CA Architecture Overview 



Table 3-7 Row and Column Address Decode for BanksetS 



S8_ColSel 


Memory 
Width 


Row,ld Bits 


Row 
Address<11:0> 


Column Address<11:0> 


100 


64 


9,9 


xxx,<20:12> 


xxx,<Ll:3> 


101 


64 


9,8 


xxx,<19:ll> 


xxxx,<L0:3> 


100 


128 


9,9 


xxx,<21:13> 


xxx,<L2:4> 


101 


128 


9,8 


xxx,<20:12> 


xxxx,<Ll:4> 








Nntfi 





BankSetO through bankset? cannot have less than 10 column bits as 
the smallest DRAM size supported is IM x 1. BanksetS cannot have 
more than 9 column bits as the largest VRAM supported is 256K x 8. 



3.2.4 Performance Optimizations 

The following sections describe performance optimizations. 

3.2.4.1 Memory Page Mode Support 

The 21071-CA chip supports page mode within CPU read transactions. Page 
mode between transactions is supported on DMA read burst transactions and 
on memory write transactions. 

The following page mode features are supported by the 21071-CA chip: 

• A refresh transaction never starts in page mode. If any memRAS_l is 
asserted when the refresh transaction is selected, the controller waits for 
the duration of the RAS precharge before doing the refresh. 

• A video transaction never starts in page mode. If memRAS_l<8>or 
memRASB_l<8>are asserted when the video transaction is selected, 

the controller waits for the duration of the RAS precharge before doing the 
transaction. 

• A memory read transaction will start in page mode if the preceding 
transaction was a memory read initiated by a DMA read burst command 
on the sysBus, and the row address, bankset, and subbank of the current 
transaction are the same as that of the previous transaction. Furthermore, 
a memory read initiated by a CPU transaction (read or partial write) will 
never start in page mode. This is because the sysB us controller notifies 
the memory controller to deassert RAS if the sysBus has been given to the 
CPU after a DMA read burst. 
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• A memory write transaction starts in page mode, only if the previous 
transaction was a write, and tine row address, bankset, and subbanl< of the 
current write are the same as tinat of tine previous transaction. 

I n all of the previous cases, the transaction will not start in page mode if the 
maximum RAS width counter has overflowed. The RAS has to be precharged 
even if there is a page hit. 

A transaction that does not start in page mode may or may not have the extra 
latency of RAS precharge. If the current transaction is to a different banl<set 
than the previous one, the RAS for the previous transaction is deasserted, and 
at the same time, the RAS for the current one is asserted. 

3.2.4.2 Read Latency Minimization 

I n order to minimize the read latency seen by devices on the sysBus, the 
memory controller performs certain optimizations in the way transactions are 
selected. In general, because writes can go into a deep write buffer, reads 
are given priority over writes, to the extent that in some cases the memory 
controller waits for a read to happen even if there are writes queued up in the 
write buffer. These situations are described here: 

Following a memory read initiated by a CPU or DMA transaction on the 
sysBus (CPU read or a partial write), the 21071-CA chip does not service a 
write from the write buffer for 12 memClk cycles after the last read data has 
latched, unless the write buffer is full. The reason for doing this is that there 
is a delay between the completion of the read by the memory controller and 
the initiation of another read on the sysBus. Servicing a write from the write 
buffer would add latency to the following read. This will definitely happen on 
reads that have Bcache victims, because every read will be accompanied by a 
write. The write will add latency to the next read, and the effect of the victim 
buffer will be minimal. This condition is called Wait After Read (WAR). Waiting 
after a DMA read also helps in the case of a scatter/gather read performed in 
guaranteed-access-time (GAT) mode. See Section 9.4.3 for more details about 
GAT mode. 

The writes are held off only if the write buffer is not full. 

3.2.5 Transaction Scheduler 

The memory interface does memory refresh, cache line reads, cache line writes 
and shift register loads to VRAM banksetS. The memory controller has a 
scheduler that prioritizes transactions and selects one of them to be serviced. 
If the selected transaction is waiting for RAS precharge, and in the meantime 
another higher priority transaction comes along, the scheduler deselects the 
previously chosen transaction and selects the higher priority one. 
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Table 3-8 describes the priority scheme. 



Table 3-8 Memory Transaction Scheduling 



Refresh Read 


WBuf 


Write 


WBuf 


Video 








Request Request 


Hit^ 


Request 


Full^ 


Request 


RB" 


WAR^ 


Select 


1 Xi 


X 


X 


X 


X 


X 


X 


Refresh 


X 


X 


X 


X 


1 


X 


X 


Video 


1 





X 


X 





X 


X 


Read 


1 


1 


X 


X 





X 


X 


Write 





X 


X 


1 








X 


Write 





X 


1 








X 





Write 





X 


X 


X 





1 


X 


None 





X 


X 








X 


1 


None 



^X : Don't care 

^WBuf Hit: Read address matches buffered write. 

^WBuf Full: Write buffer full. 

"•RB: Read burst. Hint to stay in read page mode. 

^WAR: Wait after read. Internal stall signal. 



3.2.6 Programmable Memory Timing 

The memory control state machine sequences through all the memory 
transactions. On memory read and write transactions, it has to communicate 
with the 21071-BA chips so that data may be latched from the memData bus or 
driven onto the memData bus respectively. All memory signals are generated 
on memClkR. However commands from the 21071-CA chip to the 21071-BA 
chip are sent on sysClocks (clk2R). Because the sysClock cycle time is twice 
that of the memClk, the 21071-BA chips have to be informed which memClk 
the data has to be latched on. This is done by sending immediate and delayed 
commands. Immediate commands require that data is latched (or driven) on 
the next memClk rising edge, and the delayed commands require that data be 
latched (or driven) on the second memClkR. 

The memory control state machine is actually made up of two separate state 
machines — one is the master, which does all the RAS and CAS assertion, and 
controls when the other state machines start; the second is the read/write state 
machine, which does all the sequencing for generating the memCmds to read 
or write memory data. The read/write state machine is started by the master, 
and then it sequences independently. Each state machine uses some of the 
programmed timing parameters to generate the corresponding memory control 
signals. 
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Note 

While programming the memory timing, ensure that the parameters 
used for the address, RAS, and CAS are compatible with the ones used 
for data; otherwise, operation on the memory interface will be incorrect. 



Because memCmds have to be sent to the 21071-BA chips on clk2R, the 
memory controller synchronizes the start of all transactions to clk2R. This 
way, the memory control signals track the memory data according to the 
programmed values. This synchronization may add an extra delay of one 
memClk on memory transactions. When the memory controller is idle, sysBus 
reads or writes do not have the extra delay, because the corresponding requests 
are generated synchronous to sysClock. 

3.2.7 Presence Detect Logic 

The 21071-CA chip supports loading the status of 32 presence pins into a 
register after reset. The 32 bits are loaded into a shift register on the module 
and then shifted one bit at a time into the 21071-CA chip. 

As soon as the internal synchronized version of reset deasserts, the loading 
process begins. First, the data is loaded into the shift register by asserting 
memPDLoad_l and pulsing memPDCIk. Then a bit is loaded by toggling 
memPDLoad_l. Either edge of memPDCIk may be used to shift memPDDIn, as 
memPDDI n is sampled when memPDCIk is stable. Once all 32 bits have been 
loaded, memPDCIk stops and the presence detect registers may be read. See 
Figure 3-4, which shows the operation of the presence detect logic. 
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Figure 3-4 Presence Detect Logic Operation 
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Table 3-9 shows the presence detect shift registers that are supported. 



Table 3-9 Supported Presence Detect Shift Registers 



Part 


Bits^ 


/Load 2 


clk3 


Din" 


Dout^ 


Vcc6 


Gnd^ 


74F 166 


8 


/PE 


CP 


DS 


07 


— 


/CE 


74F194 


4 


*S1 


CP 


DSR 


03 


/MR,SO 


— 


74F195 


4 


/PE 


CP 


J,/K 


03 


/MR 


— 


74F199 


8 


/PE 


CP 


J,/K 


07 


/MR 


/CE 


74F299 


8 


*S1 


CP 


DSR 


00 


/MR,/OE 


— 


74F322 


8 


S/P 


CP 


DO 


07 


/MR,/SE 


/RE,S 


74F323 


8 


*S1 


CP 


DSR 


00 


/SR 


/OE 


74F395 


4 


*PE 


/CP 


DS 


OS 


/MR 


/OE 


74F674 


16 


R/W 


/CP 


NotSup 


015 


M 


/CS 


74F676 


16 


*M 


/CP 


SI 


so 


— 


/CS 



■■^N umber of presence detect pins supported. 

^Pins to tie to nnennPDLoad_l. Asterisl< (*) indicates tliat signals must be inverted on module. 

^Pins to tie to memPDCIk. 

''Pins to daisy chain data into. 

^Pins to daisy chain to next shift register or to memPDDI n. 

^Pins to be tied high. 

^Pins to be tied low. 



3.2.8 Video Support Logic 

The 21071-CA chip provides the logic and control to perform full and split 
serial register loads to the VRAM banl<set8. The 21071-CA chip does regular 
CPU/DMA accesses to the random port of banl<set8 if the address matches 
the banl<'s base address, just lil<e for any other banl<set. I n addition, the 
21071-CA chip does serial register loads in response to vframej or vRefresh_l 
pin assertions. When the 21071-CA chip does a serial register load, the VRAM 
latches the data in the accessed row into its serial register. Other external 
logic then shifts out the serial register through the VRAM's serial port. The 
21071-CA chip does not provide any support for unloading the serial port of 
the VRAM. Figure 3-5 shows an implementation of a video subsystem using a 
dumb frame buffer in banksetS. 

In a full serial register load, the entire RAM row specified by the row address 
is latched into the serial register. In a split serial register load, only half the 
row is latched into the serial register. The MSB of the column address specifies 
whether the upper or lower half of the row will be latched. 
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In terms of timing, a serial register load is identical to a memory read 
to banksetS, with the exception of memDTOE and memDSF. The data on 
memData<31:0> is ignored during serial register loads. 

The 21071-CA chip provides the logic and control to perform full and split 
serial register loads to the VRAM banksetS. The Video Frame Pointer (VFP) 
CSR provides the start address of the video frame buffer in memory. An 
internal set of latches, called the Video Display Pointer (VDP), contains the 
subset, row, and column addresses for video shift register loads. 

Following a vFrame_l assertion, the Video Frame Pointer is latched into the 
VDP. A full serial register load is performed at the subbank and row address 
indicated in the VDP, with an all-zero column address. At the end of the load, 
the row address in the VDP is incremented (mod 512) to point to the next 
row. I n case of overflow, the subbank bit in the VDP is toggled if subbanks are 
enabled for banksetS. The column MSB in the VDP is toggled. 

Following a vRefresh_l assertion, a split serial register load is performed at the 
subbank and row address indicated in the VDP. The column MSB in the VDP 
is toggled. If the new column MSB equals 0, the row address in the VDP is 
incremented. If the row address overflows (mod 512), the subbank bit in the 
VDP is toggled if the subbank is enabled. 

The memory controller can take up to 135 sysClk cycles to complete a serial 
register load after the assertion of vFrame_l or vRefreshJ. If a request is 
reasserted before the previous request has been completed, the second request 
may either override the first request or it may be ignored. 

Simultaneous assertion of vFrame_l and vRefreshJ can cause one of the 
requests to be serviced whi le the other is lost. 

Figure 3-5 shows a video subsystem using a DECchip 21071 chipset and a 
dumb frame buffer. 
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Figure 3-5 Video Subsystem Using a DECcliip 21071 Chipset and a Dumb Frame 
Buffer 
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This chapter describes the 21071-CA control and status registers (CSRs). It 
also provides information about how to program memory timing, configure 
memory, and initialize the Bcache. 

4.1 Register Descriptions 

This section describes the 21071-CA control and status registers (CSRs). 
These CSRs are 16 bits wide and addressed on cache-line boundaries only. 
Writes to read-only registers could result in unpredictable behavior. Reads are 
nondestructive. Only zeros should be written to unspecified bits within a CSR. 
Only bits <15:0>of each CSR are defined. Other bits are undefined. CSRs are 
initialized as specified in the register descriptions. 

Table 4-1 shows the base address and name of all the control and status 
registers. 

Table 4-1 DECchip 21071-CA Register Summary 



Address 



Name 



1 8000 0000 


General control register 


1 8000 0020 


Reserved 


1 8000 0040 


Error and diagnostic status register 


1 8000 0060 


Tag enable register 


1 8000 0080 


Error low address register 


1 8000 OOAO 


Error high address register 


1 8000 OOCO 


LDx L low address register 


18000 00E0 


LDx_L high address register 



(continued on next page) 
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Table 4-1 (Cont.) DECchip 21071-CA Register Summary 



Address 



Name 



1 8000 0200 
1 8000 0220 
1 8000 0240 
1 8000 0260 
1 8000 0280 



Global timing register 
Refresh timing register 
Video frame pointer register 
Presence detect low data register 
Presence detect high data register 



8000 
8000 
8000 
8000 
8000 
8000 
8000 
8000 
8000 



0800 
0820 
0840 
0860 
0880 
08A0 
08C0 
08E0 
0900 



Bank 
Bank 
Bank 
Bank 
Bank 
Bank 
Bank 
Bank 
Bank 



base 

1 base 

2 base 

3 base 

4 base 

5 base 

6 base 

7 base 

8 base 



address 
address 
address 
address 
address 
address 
address 
address 
address 



register 
register 
register 
register 
register 
register 
register 
register 
register 



8000 
8000 
8000 
8000 
8000 
8000 
8000 
8000 
8000 



OAOO 
0A20 
0A40 
0A60 
0A80 
OAAO 
OACO 
OAEO 
OB 00 



Bank 
Bank 
Bank 
Bank 
Bank 
Bank 
Bank 
Bank 
Bank 



configi 

1 configi 

2 configi 

3 configi 

4 configi 

5 configi 

6 configi 

7 configi 

8 configi 



u rati on 
u rati on 
u rati on 
u rati on 
u rati on 
u rati on 
u rati on 
u rati on 
u rati on 



register 
register 
register 
register 
register 
register 
register 
register 
register 



8000 
8000 
8000 
8000 
8000 
8000 
8000 
8000 
8000 



OCOO 
0C20 
0C40 
0C60 
0C80 
OCAO 
OCCO 
OCEO 
ODOO 



Bank 
Bank 
Bank 
Bank 
Bank 
Bank 
Bank 
Bank 
Bank 



timing 

1 timing 

2 timing 

3 timing 

4 timing 

5 timing 

6 timing 

7 timing 

8 timing 



register A 
register A 
register A 
register A 
register A 
register A 
register A 
register A 
register A 



1 8000 OE 00 
1 8000 OE 20 



Bank timing register B 
Bank 1 timing register B 



(continued on next page) 
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Table 4-1 (Cont.) DECchip 21071-CA Register Summary 



Address 



Name 



8000 
8000 
8000 
8000 
8000 
8000 
8000 



0E40 
0E60 
0E80 
OEAO 
OECO 
OEEO 
OF 00 



Bank 2 timing register B 
Banl< 3 timing register B 
Banl< 4 timing register B 
Banl< 5 timing register B 
Banl< 6 timing register B 
Banl< 7 timing register B 
Banl< 8 timing register B 



4.2 General Registers 

This section describes the 21071-CA general registers. These registers control 
the sysBus state machine and associated logic. 

4.2.1 General Control Register 

The general control register contains status information that affects the major 
operational modes of the entire 21071-CA chip. Figure 4-1 shows the register 
bit assignments, and Table 4-2 provides the bit descriptions for the general 
control register. 

Figure 4-1 General Control Register 



15 



09 08 07 06 05 04 03 02 01 



1 8000 0000 

Reserved 
sysArb 
Reserved 
- wideMem 
bc_EN 
bc_NoAlloc 
bc_LongWr 
bc_lgnTag 
bc_FrcTag 
bc_FrcD 
bc_FrcV 
bc_FrcP 
bc_BadAP 
Reserved 
Reserved 
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Table 4-2 General Control Register 



Field 



Type, 
Bits Reset 



Description 



Reserved 
sysArb 



<0> MBZ 

<2:1> RW,0 



DMA arbitration mode. Determines arbi- 
tration sciiemefor sysBus transactions. 



Value Meaning 



Reserved 
wideMem 



be En 



be NoAlloc 



bc_LongWr 



bc_l gnTag 



<3> 

<A> 



MBZ 
RO,- 



<5> RW,0 



<6> RW,0 



<7> RW,0 



<8> RW,0 



OX 
10 
11 



CPU priority 
DMA priority 
DMA strong priority 



See Section 3.1.1 for a detailed description 
of til ese fields. 



Memory size. Reads tlie status of tlie 
wideMem input pin. Returns 1 if tlie 
memory is 128 bits wide, or if 64 bits 
wide. 

Bcaclie enable. When clear, the Bcache is 
disabled and the cache state machine will 
not probe the cache. 

Bcache no allocate mode. When set, CPU 
writes to cacheable memory space will not 
be allocated into the cache. 

Bcache long writes. When set, two sysBus 
cycles are required to write to the cache 
data RAMs. See Section 5.1.4. 

Bcache ignore tag. When set, Bcache 
probes will act as if the valid bit was 
invalid. All tag results will be ignored, and 
any victims will be lost. Tag and address 
parity will be ignored. May be used to fill 
the cache with valid data. 



(continued on next page) 
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Table 4-2 (Cont.) General Control Register 



Field 



Bits 



Type, 
Reset 



Description 



bcF rcTag 



<9> 



RW,0 



bc_FrcD 


<10> 


RW,0 


bc_FrcV 


<11> 


RW,0 


bc_FrcP 


<12> 


RW,0 


be BadAP 


<13> 


RW,0 



Reserved 



<15:14> MBZ 



Bcache force tag. When set, the Bcache 
will be probed for victims, and the line 
will be invalidated using the values in the 
bc_FrcD, bc_FrcV, and bc_FrcP. CSRs will 
be used as the tag controls. Although the 
line is invalidated (assuming bc_FrcV 
is reset), the data is loaded into the 
cache and will be returned to the CPU 
as cacheable. Used for diagnostic testing 
of the cache RAM, and for flushing the 
cache by setting this bit, clearing bc_FrcV, 
and cycling through the address range 
present in the cache. 

Bcache force dirty. When set, the dirty bit 
will beset on the next cache fill. 

Bcache force valid. When set, the valid bit 
will beset on the next cache fill. 

Bcache force parity. When set, the parity 
bit will beset on the next cache fill. 

Bcache force bad address Parity. When 
set, the tag address parity will be loaded 
as bad. This bit is independent of the 
bc_FrcTag bit. 



4.2.2 Error and Diagnostic Status Register 

The error and diagnostic status register contains status information for 
diagnostics and for error analysis. The occurrence of an error sets one or 
more error bits (bc_TAPErr, bc_TCPErr, nxM Err) and locks the address of the 
error. After the address is locked, any additional error will set lostErr and will 
not affect the address or other error bits (bc_TAPErr, bc_TCPErr, nxMErr). 
Clearing all of the error bits (not the lostErr bit) unlocks the address. 

Figure 4-2 shows the register bit assignments, and Table 4-3 provides the bit 
descriptions for the error and diagnostic status register. 
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Figure 4-2 Error and Diagnostic Status Register 
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1 8000 0040 

lostErr 

bc_TAPErr 

bc_TCPErr 

nxMErr 

dmaCause 

vicCause 

cReqCause 

Reserved 

pass2 

IdxILock 

wrPend 



Table 4-3 Error and Diagnostic Status Register 



Field 



Bits 



Type, 
Reset 



Description 



lostErr <0> RWIC.O Multiple errors. When set, indicates that 

additional errors occurred when an error 
address was already locked. No address or 
cause information is latched for the error. 
Cleared by writing a 1 to lostErr. 

bc_TAPErr <1> RWlC.O Bcache tag address parity error. When set, 

indicates that a tag probe encountered bad 
parity in the tag address RAM. Set only 
when address is unlocked. 

bc_TCPErr <2> RWlC.O Bcache tag control parity error. When set, 

indicates that a tag probe encountered bad 
parity in the tag control RAM. Set only 
when address is unlocked. 

nxMErr <3> RW1C,0 Nonexistent memory error. When set, 

indicates that a read or write occurred to 
an invalid address that does not map to 
any memory bank, CSR, or I/O quadrant. 
Set only when address is unlocked. 

(continued on next page) 
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Table 4-3 (Cont.) Error and Diagnostic Status Register 



Field 



Bits 



Type, 
Reset 



Description 



dmaCause 



<A> 



RO, 



vicCause 



<5> 



RO, 



cReqCause 



<8:6> 



RO, 



Reserved 
pass2 

IdxILcxk 



wrPend 



<12:9> 
<13> 

<14> 



<15> 



MBZ 
R0,1 

RO,- 



RO,0 



DMA transaction caused error. Wlien 
set, indicates tliat tlie bc_TAPErr, 
bc_TCPErr, or nxMErr was caused by 
a DMA transaction. Locl<ed witli tlie error 
address. Only valid when an error is 
indicated on bc_TAPErr, bc_TCPErr, or 
memErr. 

Victim write caused error. When set, 
indicates that an NXM error was caused 
by a victim write transaction. Undefined 
for other types of errors. Locked with the 
error address. Only valid when an error 
is indicated on bc_TAPErr, bc_TCPErr, or 
memErr. 

Cycle request that caused error. I ndicates 
the DMA or CPU cycle request type that 
caused the error. Copy of either the 
cpuCReq or ioCmd lines depending on 
the DmaCause CSR. Locked with the 
error address. Only valid when a error 
is indicated on bc_TAPErr, bc_TCPErr, or 
memErr. 



Chip version reads low on passl and high 
on pass2. 

LDx_L locked. When set, indicates that 
the lock bit for LDx_L is set, and that the 
next STx_C may succeed. Writing to any 
CSR or I/O space location clears this lock 
bit. 

Write pending. When set, indicates that 
valid write data is stored in the write 
buffer. 
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4.2.3 Tag Enable Register 

The tag enable register is a read/write register. This register indicates which 
bits of the cache tag are to be compared with sysAdr<33:5>. If a bit is 1, the 
corresponding bits in sysAdr<33:5>and tagAdr<31:17> are compared. If a 
bit is 0, there is no comparison for those bits, and the tagAdr bit is assumed 
to be tied low on the module (through a resistor). Bits <15:l>in the register 
represent tagAdr<31:17>. This register is not initialized. 

There is no requirement that the upper bits of tagEn be set. An 
implementation that does not allow the full 4 GB of cacheable memory to 
be installed may mask off upper bits of TagEn, and save having to store a bit 
in the tag address in the tag address RAM. 

To construct the tagEn bits, see Tables 4-4 and 4-5. The value shown in 
Table 4-4 (based on the cache size) is ANDed with the value in Table 4-5 
(based on the maximum cacheable system memory). 

The following example shows how to program a system with a 16 M B cache 
and a maximum of 1 GB of cacheable memory: 

nil nil 0000 OOOX (ANOed with (16 MB, Table 4-4) ) 
0011 nil nil nix (gives (1 GB, Table 4-5)) 

0011 nil 0000 OOOX (value is put into tag enable register) 

Figure 4-3 shows the register bit assignments for the tag enable register. 
Table 4-4 provides the cache size tag enable values, and Table 4-5 provides 
the maximum memory tag enable values. 

Figure 4-3 Tag Enable Register 
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Table 4-4 Cache Size Tag Enable Values 



tagEn<15:0> 



Compared Cache Size 



0000 


0000 


0000 ooox 


None 


4 GB 


1000 


0000 


0000 ooox 


<31:31> 


2 GB 


1100 


0000 


0000 ooox 


<31:30> 


1GB 


1110 


0000 


0000 ooox 


<31:29> 


512 MB 


1111 


0000 


0000 ooox 


<31:28> 


256MB 


1111 


1000 


0000 ooox 


<31:27> 


128MB 


1111 


1100 


0000 ooox 


<31:26> 


64MB 


1111 


1110 


0000 ooox 


<31:25> 


32 MB 


1111 


1111 


0000 ooox 


<31:24> 


16MB 


1111 


1111 


1000 ooox 


<31:23> 


8MB 


1111 


1111 


1100 ooox 


<31:22> 


4MB 


1111 


1111 


1110 ooox 


<31:21> 


2 MB 


1111 


1111 


1111 ooox 


<31:20> 


1MB 


1111 


1111 


1111 lOOX 


<31:19> 


512KB 


1111 


1111 


1111 llOX 


<31:18> 


256KB 


1111 


1111 


1111 lllX 


<31:17> 


128KB 



Table 4-5 Maximum Memory Tag Enable Values 



tagEn<15:0> 



Compared Memory Size 



1111 


1111 


1111 lllX 


<31:17> 


4 GB 


0111 


1111 


1111 lllX 


<30:17> 


2 GB 


0011 


1111 


1111 lllX 


<29:17> 


1GB 


0001 


1111 


1111 lllX 


<28:17> 


512MB 


0000 


1111 


1111 lllX 


<27:17> 


256MB 


0000 


0111 


1111 lllX 


<26:17> 


128MB 


0000 


0011 


1111 lllX 


<25:17> 


64MB 


0000 


0001 


1111 lllX 


<24:17> 


32MB 


0000 


0000 


1111 lllX 


^3:17> 


16MB 


0000 


0000 


0111 lllX 


<22:17> 


8MB 


0000 


0000 


0011 lllX 


<21:17> 


4MB 


0000 


0000 


0001 lllX 


<20:17> 


2 MB 


0000 


0000 


0000 lllX 


<19:17> 


1MB 


0000 


0000 


0000 OllX 


<18:17> 


512KB 


0000 


0000 


0000 OOIX 


<17:17> 


256KB 


0000 


0000 


0000 ooox 


None 


128KB 
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4.2.4 Error Low Address Register 

The error low address register locks the low order bits of the sysBus address 
that caused the error that set the bc_TAPErr, bc_TCPErr, or nxMErr bit in 
the error and diagnostic status register. If a victim read caused the error, 
then the victim address is not latched; rather, the address of the transaction is 
latched. Bits <15:0> represent sysAdr<20:5>. This register is read-only. It is 
not initialized and is only valid when an error is indicated. 

Figure 4-4 shows the register bit assignments for the error low address 
register. 

Figure 4-4 Error Low Address Register 
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4.2.5 Error High Address Register 

The error high address register locl<s the high order bits of the sysBus address. 
Bits <12:0> represent sysAdr<33:21>. This register is read-only. It is not 
initialized and is only valid when an error is indicated. 

Figure 4-5 shows the register bit assignments for the error high address 
register. 

Figure 4-5 Error High Address Register 
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Reserved 



4.2.6 LDx_L Low Address Register 

The LDx_L low address register stores the low order bits of the last locl<ed 
address. Bits <15:0>in the register represent sysAdr<20:5>. This register is 
read-only, and it is not initialized. 
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Figure 4-6 shows the register bit assignments for the LDx_L low address 
register. 

Figure 4-6 LDx_L Low Address Register 
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4.2.7 LDx_L High Address Register 

The LDx_L high address register stores the high order bits of the locl<ed 
address. Bits <12:0>in the register represent sysAdr<33:21>. This register is 
read-only, and it is not initialized. 

Figure 4-7 shows the register bit assignments for the LDx_L high address 
register. 

Figure 4-7 LDxL High Address Register 
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4.3 Memory Registers 

The following registers on the 21071-CA chip control memory configuration and 
timing. Each bankset of memory has one configuration register and two timing 
registers. The global timing register and refresh timing register apply to all 
banksets. The video frame pointer is used for video transactions to banksetS. 
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4.3.1 Video Frame Pointer Register 

The video frame pointer register contains address information tinat points to 
the beginning of the video frame buffer. The video frame pointer is loaded 
into the video display pointer at the beginning of each full serial transfer to 
banksetS. This register is not initialized. 

Figure 4-8 shows the register bit assignments, and Table 4-6 provides the bit 
descriptions for the video frame pointer register. 

Figure 4-8 Video Frame Pointer Register 
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Table 4-6 Video Frame Pointer Register 



Field 



Type, 
Bits Reset 



Description 



vfp_Col<4:0> 



vfp_Row<8:0> 



<4:0> RW,- 



<13:5> RW,- 



Video frame column address pointer. 
vfp_Col<4:0>are used as column 
address <6:2>for all serial register 
loads. 

Video frame row address pointer. 
Row address of the start of the 
frame buffer. 

(continued on next page) 
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Table 4- 


-6 (Cont.) 


Video Frame Pointer Reg 


ster 


Field 




Bits 


Type, 
Reset 


Description 



vfp_SubBank 



<14> 



RW,- 



Reserved 



<15> 



MBZ 



Video frame subbank pointer. 
Subbank for tlie start of tlie 
frame buffer. If tlie subbank is 
enabled by setting s8_SubEna 
in tine banksetS configuration 
register, setting tlie vfp_SubBank 
bit causes tlie 21071-CA cliip to 
assert memRASBJ<8> instead of 
memRASJ<8>on full serial register 
loads. vfp_SubBank is ignored if 
s8 SubEna is cleared. 



4.3.2 Presence Detect Low Data Register 

The presence detect low data register stores the low order bits of the presence 
detect information that was shifted in after reset. Bits <15:0>in the register 
represent data bits <15:0>that were shifted in. 

Note 



After deassertion of reset, it takes 148 system clock cycles for presence 
detect data to become valid. 



Figure 4-9 shows the register bit assignments for the presence detect low data 
register. 



Figure 4-9 Presence Detect Low Data Register 
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4.3.3 Presence Detect High Data Register 

The presence detect high data register stores the high order bits of the 
presence detect information that was shifted in after reset. Bits •<15:0> in the 
register represent data bits <31:16>that were shifted in. 

Note 



After deassertion of reset, it tal<es 148 system clocl< cycles for presence 
detect data to become valid. 

Figure 4-10 shows the register bit assignments for the presence detect high 
data register. 

Figure 4-10 Presence Detect High Data Register 
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4.3.4 Base Address Registers 

Each memory banl<set has a corresponding base address register. The bits in 
this register are compared with the incoming sysAdr to determine the banl<set 
being addressed. The contents of this register are validated by setting the 
valid bit in the configuration register of that banl<set. 

The base address of each bankset must begin on a naturally aligned boundary. 
(So, for a banl<set with 2" addresses, the n least significant bits must be zero.) 



Note 



Software could require contiguous memory. Because banl<sets must 
be naturally aligned, the programmer should ensure that the largest 
banl<set is placed at the lowest base address, the next largest banl<set 
is placed at a base address following the end of the largest bankset, 
and so on, to create contiguous memory. 



BanksetS must be placed on an aligned 8MB boundary for bank sizes less 
than or equal to 8 MB. 
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If banksetS has parity or ECC checking disabled (s8_Check bit clear), then 
banksetS must be mapped into noncacheable space (S8_BaseAdr<32>set). 

Figure 4-11 shows the register bit assignments for the banksetO base address 
register. 

Figure 4-11 BanksetO Base Address Register 
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4.3.5 Configuration Registers 

Each memory bankset has a corresponding configuration register. This register 
contains mode bits and bits for memory address generation, as well as bankset 
decoding. Banksets through 7 have the same limits on bankset size and 
type of DRAMs used. The format of the configuration register is the same 
for banksets through 7. BanksetS is the VRAM bank. It supports different 
minimum DRAM sizes and configurations; therefore, its configuration register 
is different. 

With the exception of the valid bit, this register is not initialized. 

Figure 4-12 shows the register bit assignments, and Table 4-7 provides the bit 
descriptions for the bankset through 7 configuration registers. 

Figure 4-12 Bankset Configuration Register 
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Table 4-7 BanksetO Configuration Register 



Field 



Bits 



Type, 
Reset 



Description 



sO_Valicl <0> RW,0 BanksetO valid. If set, all timing and 

configuration parameters for banksetO are 
valid, and access to banksetO is allowed. If 
cleared, access to banksetO is not allowed. 

sO_Size<3:0> <4:1> RW,- BanksetO size in MB. Indicates tlie size of 

tlie bankset in order to determine wliicii 
bits are used to compare tlie bankset base 
address witli the pliysical address (PA) 
and to generate the subset. Corresponds 
to the total size of the bankset, including 
subbanks, if present. sO_Size<3>must be 
set to 0. 



SO 








Size<3:0> 


Compared 


Subset 


Set Size 


0000 


— 


— 


Reserved 


0001 


PA<33:29> 


PA<28> 


512 MB 


0010 


PA<33:28> 


PA<27> 


256MB 


0011 


PA<33:27> 


PA<26> 


128MB 


0100 


PA^3:26> 


PA<25> 


64MB 


0101 


PA<33:25> 


PA<24> 


32MB 


Olio 


PA<33:24> 


PA<23> 


16MB 


0111 


PA<33:23> 


PA<22> 


8MB 


IXXX 


— 


— 


Reserved 



sO SubEna 



<5> RW,0 Enable subbanks. When set, subbanks are 

enabled and determined according to the 
previous table. When clear, subbanks are 
disabled, and the memRASBJ pins will be 
asserted only during refreshes. 

(continued on next page) 
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Table 4-7 (Cont.) BanksetO Configuration Register 



Type, 
Field Bits Reset Description 



sO_ColSel<2:0> <8:6> RW,- Column address selection. Indicates the 

number of valid column bits expected at 



the DRAMs. Used along with memory 
width information to generate row or 
column addresses. Memory width is 
determined by the wideMem pin. See 
Table 3-6 for more information. 



S0_ColSel<2:0> 


Row,Column Bits 


000 


12,12 


001 


12,10 or 11,11 


010 


Reserved 


on 


10,10 


IXX 


Reserved 



Reserved <15:9> MBZ 
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F igure 4-13 shows the register bit assignments, and Table 4-8 provides the bit 
descriptions for the banl<set8 configuration register. 



Figure 4-13 BanksetS Configuration Register 
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Table 4-8 Bankset 8 Configuration Register 



Field 



Bits 



Type, 
Reset 



Description 



s8_Valid <0> RW,0 Valid. If set, all parameters are valid, and 

access to banksetS is allowed. If cleared, 
no accesses to banksetS are allowed. 

s8_Size<3:0> <4:1> RW,0 Size. Indicates the size of the bankset in 

order to determine which bits are used 
to compare the base address with the 
physical address and to select the subset 
(if s8_SubEna is set). Corresponds to the 
total size of banksetS, including subbanks, 
if present. 



s8 






Bankset 


Size<3:0> 


Compared 


Subbank 


Size 


oxxx 


— 





Reserved 


1000 


— 


— 


Reserved 


1001 


PA<33:23> 


PA<22> 


8MB 


1010 


PA<33:22> 


PA<21> 


4MB 


1011 


PA<33:21> 


PA<20> 


2 MB 


1100 


PA<33:20> 


PA<L9> 


1 MB 


1101 


— 


— 


Reserved 


1110 


— 


— 


Reserved 


1111 


— 


— 


Reserved 



(continued on next page) 
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Table 4-8 (Cont.) Bankset 8 Configuration Register 



Field 



Type, 
Bits Reset 



Description 



s8 SubEna 



s8 ColSel<2:0> 



<5> 



RW,0 



<8:6> RW,- 



Enable subbanks. When set, subbanks are 
enabled and determined according totlie 
previous table. When clear, subbanks are 
disabled, and the memRASBJ pins will 
only be asserted during refresh. 

Column address selection. I ndicates the 
number of valid column bits expected at 
the DRAMs. Used along with memory 
width information to generate column 
row or column addresses. Memory width 
is determined by the wideMem pin. See 
Table 3-7 for more information. 



S8 ColSel 



Row, Column Bits 



oxx 


Reserved 


100 


9, 9 


101 


9, 8 


IIX 


Reserved 



s8 Check 



<9> 



RW,0 



Reserved 



<15:10>|V1BZ 



Enable ECC/parity checking. When set, 
accesses to bankset8, like other banksets, 
will have their parity or ECC checked. 
When clear, parity or ECC will not be 
checked. When clear, bankset8 must be 
mapped into noncacheable space. Only 
bankset8 has this feature. 

DMA accesses to this bank should not be 
performed when error checking is disabled. 



4.3.6 Bankset Timing Registers A and B 

Each bankset has two timing registers associated with it. These registers 
contain the timing parameters required to perform memory read and write 
transactions. The format of the timing registers is identical for all 9 banksets. 

On reset, all the parameters are set to the maximum value. This may cause 
improper operation of the memory interface. The timing registers should be 
programmed by software before setting the corresponding bankset valid bit in 
the configuration register. 
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All the timing parameters are in multiples of memClk cycles. Most of the 
timing parameters in timing registers A and B have a minimum value that is 
added to the programmed value. The programmer should be careful to subtract 
this value from the desired value before programming it into the register. The 
parameter descriptions in this section also indicate the corresponding DRAM 
parameter. 

See Section 4.4 to determine how the timing register should be programmed 
for particular memory transactions. 

Figure 4-14 shows the register bit assignments, and Table 4-9 provides the bit 
descriptions for the bankset timing register A. 

Figure 4-14 Bankset Timing Register A 
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Table 4-9 BankSet Timing Register A 



Field 



Type, 
Bits Reset 



Description 



s8_RowSetup<l:0> <1:0> RW.ls 



s8 RowHold<l:0> <3:2> RW.ls 



Row address setup (tASR). Used to 

generate memRAS_l assertion from row 

address. 

Programmed_Value = Desired_Value -1 

Row address hold (tRAn)- Used to 
switch memAdr from row to column 
after memRAS_l assertion. 
Programmed_Value = Desired_Value -1 

(continued on next page) 
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Table 4- 


-9 (Cont.) 


BankSet Timing Register A 


Field 




Bits 


Type, 

Reset Description 



s8_ColSetup<2:0> <6:4> RW.ls 



s8 ColHold<l:0> <8:7> RW.ls 



s8_RDIyRow<2:0> <11:9> RW.ls 



s8_RDIyCol<2:0> <14:12> RW.ls 



Column address setup (Iasc) to first 
CAS assertion and write enable setup 
(tcwL) to CAS assertion. Used to 
determine first memCASJ assertion 
after column address and memCAS_l 
assertion after memWE_l. The 
maximum of the two setup values 
should be programmed. A programmed 
value of 7 is illegal. 
Programmed_Value = Desired_Value -1 

Column hold (tcAn) from memCASJ 
assertion. Used to determine when the 
current column address can be changed 
to the next column or row address. 
Programmed_Value = Desired_Value -1 

Read delay from row address. Delay 
from row address to latching first valid 
read data. 
Programmed_Value = Desired_Value -4 

Read delay from column address. Used 
only when starting in page mode. 
Delay from column address to latching 
first valid read data. 
Programmed_Value = Desired_Value -2 



Reserved 



<15> MBZ 
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Figure 4-15 shows the register bit assignments, and Table 4-10 provides the 
bit descriptions for the banl<set timing register B. 

Figure 4-15 Bankset Timing Register B 
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Table 4-10 Bankset Timing Register B 



Field 



Type, 
Bits Reset 



Description 



s8 RTCas<2:0> 



<2:0> RW.ls 



s8 WTCas<2:0> <5:3> RW.ls 



s8 TCP<1:0> 



<7:6> RW.ls 



S8 WHoldORow<2:0> <10:8> RW.ls 



s8 WHoldOCol<2:0> <13:11> RW,1 



Reserved 



<15:14> MBZ 



Read CAS width (tcAs)- Used on reads 
to generate tlie memCAS_l deassertion 
from tlie assertion of memCAS_l. Note: 
RTCas and TCP sliould be programmed 
so tliat tlieir sum is < 5. 
Programmed_Value = Desired_Value -2 

Write CAS widtli (tcAs)- Used on writes 
to generate tlie memCAS_l deassertion 
from tiie assertion of memCAS_l. 
Note: WTCas and TCP sliould be 
programmed so tliat their sum is < 5. 
Programmed_Value = Desired_Value -2 

CAS precharge (tcp). Delay from 
memCAS_l deassertion to the next 
assertion of memCAS_l in page mode. 
Programmed_Value = Desired_Value -1 

Write hold time from row address. Hold 
time of first write data from first row 
address. The first write data is valid 
with the row address and is held valid 
s8_WHoldORow + 2 cycles after the 
row address. Used when not starting 
in page mode. A programmed value of 
zero is illegal. 
Programmed_Value = Desired_Value -2 

Write hold time from column address 
is used only for the first data when 
starting in page mode. Write data is 
valid with the column address and is 
held valid S8_WHoldOCol +2 cycles 
after the column address. 
Programmed_Value = Desired_Value -2 



DECchip 21071-CA Programmer's Reference 4-23 



4.3.7 Global Timing Register 

The global timing register contains parameters that are common to all memory 
banksets. Each parameter counts memClk cycles. All pins on the memory 
interface are referenced to memClk rising. 



Note 



The 21071-CA chip requires the RAS precharge interval to be smaller 
than the length of a complete memory transaction. For values of 
gtr_RP less than or equal to 4 (which gives a 6 memClk precharge), 
there are no restrictions. For RAS precharge greater than 6 memClk 
cycles, each valid bankset must satisfy the following conditions: 

gtr_RP < RowHold +ColSetup +WTCas +4 
gtr_RP < RowHold +ColSetup +RTCas +4 



Figure 4-16 shows the register bit assignments, and Table 4-11 provides the 
bit descriptions for the global timing register. 



Figure 4-16 Global Timing Register 
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Table 4-11 Global Timing Register 



Field 



Bits 



Type, 
Reset 



Description 



gtr_RP<2:0> 



<2:0> RW.ls 



gtr_ivi ax_Ras_Width<2:0> 



<5:3> RW.ls 



Reserved 



<15:6> MBZ 



Minimum number of 
RAS precharge cycles. 
memRASJ deassertion 
to next assertion of the 
same memRASJ pin. 
Corresponds to DRAM 
parameter tRp. 
Programmed_Value = 
Desired_Value -2 

Maximum RAS assertion 
widtli as a multiple of 128 
memClk cycles. When 
this count is reached, the 
asserted memRASJ is 
deasserted at the end of 
the ongoing transaction. 
This value should be 
programmed with sufficient 
margin to allow for the 
timer overflowing during a 
transaction. Corresponds to 
DRAM parameter tRAs- 

When programmed to a 
0, page mode between 
transactions will be 
disabled. 



4.3.8 Refresh Timing Register 

The refresh timing register contains refresh timing information used to 
simultaneously refresh all banksets using CAS-before-RAS refresh. Therefore, 
these parameters should be programmed to the most conservative value across 
all banksets. 

The observed refresh interval may be greater than the value programmed in 
ref_interval by the number of memClk cycles required to perform a read or 
write plus a RAS precharge interval. The programmer must account for this 
behavior when choosing the value of ref_interval. 
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All the timing parameters are in multiples of memClk cycles. The parameters 
have a minimum value that is added to the programmed value. The 
programmer should be careful to subtract this value from the desired value 
before programming it to the register. 

Figure 4-17 shows the register bit assignments, and Table 4-12 provides the 
bit descriptions for the refresh timing register. 

Figure 4-17 Refresh Timing Register 
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Table 4-12 Refresh Timing Register 






Type, 
Field Bits Reset 


Description 


di; 


5Ref 




<0> 


RV 


10 


Disable refresh. Refresh 
operations will not be performed 
when disRef is set. 



ref Cas2Ras<2:0> 



ref RasWidth<2:0> 



<3:1> 



RW.ls 



<6:4> RW.ls 



Refresh CAS assertion to RAS 
assertion cycles. Corresponds 
to DRAM parameter tcsR. 
Programmed_Value = Desired_ 
Value -2 

Refresh RAS assertion 
width, from memRASJ 
assertion to memRASJ 
deassertion. memCAS_l is 
deasserted with memRAS_l 
for refresh. Corresponds 
to DRAM parameter Iras. 
Programmed_Value = Desired_ 
Value -3 

(continued on next page) 
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Table 4-12 (Cont.) Refresh Timing Register 



Field 



Bits 



Type, 
Reset 



Description 



ref lnterval<5:0> 



Reserved 
force Ref 



<12:7> 



<14:13> 
<15> 



RW, 000001 



MBZ 



WO,- 



Refresh interval. Multiplied 
by 64 to generate number of 
memClk cycles between refresh 
requests. A programmed value 
of zero is illegal. 



Force refresh. Writing a 1 to 
this bit causes a single memory 
refresh. Reads as 0. Resets the 
internal refresh interval counter. 

The other timings in this register 
should not be changed while 
setting this bit. Force refresh 
overrides disable refresh. 



DECchip 21071-CA Programmer's Reference 4-27 



4.4 Programming Memory Timing 

This section describes Inow a system designer sinould program the memory 
timings for a particular memory configuration, DRAIM speed, and sysCII< cycle 
time. The system designer should: 

1. Develop a timing diagram for memory reads, writes, refreshes, page mode 
reads, and page mode writes for the chosen memory configuration and 
sysClk cycle time. 

2. Count the number of cycles required for a particular parameter. This 
is the desired value that is referred to in the description of the various 
parameters. For each parameter there is an equation to generate the 
programmed value from the desired value (generally by subtracting a 
constant from the desired value). 

Warning 



The memData driving and latching state machines run independently 
from the state machine that controls memRas_l, memCas_l, memAdr, 
and the other controls. The two machines start at the same time, and 
then use the programmed timing to cycle through the transaction. 
Arbitrarily programming RDIyRow, RDIyCol, WHoldORow, and 
WHoldOCol could result in illegal memory transactions. 
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Tables 4-13 and 4-14 provide equations tinat must be applied while 
programming the memory timings. 

Table 4-13 Read Timings: Equations for Programmed Values 

RDIyROW =RowSetUp +RowHold +ColSetUp +Taccess^ -1 
RDIyCol =ColSetUp +Taccess -1 
RTCas > Taccess -2 
RTCas +TCP < 5 

■■^Taccess is the access time in nnemCII<s for data from CAS assertions, determined by module signal 
integrity and DRAM timing. 



Table 4-14 Write Timings: Equations for Programmed Values 

WHoldORow =RowSetUp +RowHold +ColSetUp +TDataHold^ +1 
WHoldOCol =ColSetUp +TDataHold -1 
WTCas>TDataHold-2 
WTCas +TCP < 5 

■■^TDataHold is the data hold time, in memClk cycles from CAS assertions, determined by module 
signal integrity and DRAM timing. 
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Figures 4-18 and 4-19 show the timing for a memory write and memory read, 
respectively. Assume that the two timing diagrams shown are for the same 
bankset. The programming for these transactions is shown in Table 4-15. 

Table 4-15 Programming Memory Timings 



Parameter 


Desired 
Value 


Programmed 
Value 


Timing Diagram 


RowSetU p 


2 




Read, Write 


RowHoid 


2 




Read, Write 


ColSetUp 


2 




Read, Write 


Col Hold 


2 




Read, Write 


RTCas 


3 




Read 


TCP 


1 





Read, Write 


RDIyRow 


9 


5 


Read 


WTCas 


2 





Write 


WHoldORow 


8 


6 


Write 


gtr_RP 


4 


2 


Read, Write 
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Figure 4-18 Memory Write Timing 
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Figure 4-19 Memory Read Timing 
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4.5 Configuring Memory 

The 21071-CA memory configuration and timing registers must be set up 
before memory can be read and written by the CPU. Firmware must determine 
the number of memory banksets in the system and the speed and size of the 
memory SIMMs used. 

The 21071-CA provides two methods for determining memory configuration. 

4.5.1 Using the 21071-CA Presence Detect Registers to Configure 
l\/lemory 

The system designer could use the presence detect registers in the 21071-CA to 
load in the value of the presence detect pins of the memory SIMMs following 
the deassertion of reset. See Section 3.2.7 for the details of this operation. 
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4.5.2 Polling Memory to Configure Memory 

This method can be used if the presence detect pins are not accessible via the 
21071-CA presence detect registers. The following algorithm can be used by 
the firmware to determi ne memory configuration. 

1. Configure all banksets invalid by writing to the 21071-CA bankset 
configuration registers. 

2. Read the general control register to determine whether memory is 128 bits 
wide or 64 bits wide. 

The procedure for determining the configuration is the same in both cases, 
except that the sizes in MB mentioned in the following steps should be 
halved for 64-bit wide memory. Start with banksetO. 

3. Configure bankset as valid with a base address =0, bankset size =512 MB, 
ColSel =000 (12,12 DRAMs), and subEna =1 (subbanks enabled). 

4. Configure the bankset timing registers for slow memory. For example. 
Timing Register A =4F99#16 and Timing Register B = 17D2#16. 

5. Write 11111111*16 to address 0. 

6. Write 22222222*16 to address 10*16. 

7. Read address 0; if the data is not 11111111*16, bankset has no memory. 
Store this information; configure the bankset as invalid. Go back to step 3 
and start with the next bankset. If the data read is 11111111*16, bankset 
has memory; go to the next step. 

8. Write 33333333*16 to address 128 M B . 

9. Write 44444444*16 to address 0. 

10. Read address 128 MB. If the data returned is 44444444*16, the bankset 
has wrapped back to address 0; the bankset under investigation is not a 
12,12 bankset. Go to step 11. If the data is not 44444444*16, the bankset 
is a 12,12 bankset. Determine whether it has subbanks: 

• Write address 256 M B with 55555555*16; this attempts to write the 
upper subbank. 

• Write address with 66666666*16. 

• Read address 256 M B. If the data is 55555555*16, the subbank exists; 
if not, this bankset does not have subbanks. 

• At this point, all the information for this bankset is known. Store this 
information and configure the bankset as invalid. Go to step 3 and 
start with the next bankset. 
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11. Write lllllllim<o to address 16 M B. 

12. Write 88888888*16 to address 0. 

13. Read address 16 IMB. If data returned is 88888888*16, the bankset is not 
a 12,10 or 11,11 bankset; tine bankset under investigation is not a 12,10 or 
11,11 bankset. Go to step 15. If tlie data is not 88888888*16, tine bankset 
is a 12,10 or 11,11 bankset. Determine whetlner it has subbanks: 

Configure the bankset size to 128 MB with subEna =1 (subbanks 
enabled). TheColSel and base address should remain unchanged. 

Write address 64 MB with 99999999*16; this attempts to write the 
upper subbank. 

Write address with AAAAAAAA*16. 

Read address 64 MB. If the data is 99999999*16, the subbank exists; if 
not, this bankset does not have subbanks. 

At this point, all the information for this bankset is known. Store this 
information and configure the bankset as invalid. Go to step 3 and 
start with the next bankset. 

14. Write BBBBBBBB*16 to address 8 MB. 

15. Write CCCCCCCC*16 to address 0. 

16. Read address 8MB. If the data returned is CCCCCCCC*16, the bankset 
is not a 10,10 bankset. An illegal bankset has been inserted. If the data 
returned is not CCCCCCCC*16, the bankset is a 10,10 bankset. Determine 
whether it has subbanks: 

• Configure bankset size to 32 M B with subE na = 1 (subbanks enabled). 
The ColSel and base address should remain unchanged. 

• Write address 16 M B with DDDDDDDD*16; this attempts to write the 
upper subbank. 

• WriteaddressO with EEEEEEEE*16. 
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• Read address 16 MB. If the data is DDDDDDDD#16, the subbank 
exists; if not, this bankset does not have subbanks. 

• At this point, all the information for this bankset is known. Store this 
information and configure the bankset as invalid. Go to step 3 and 
start with the next bankset. 

17. When the configurations of all the banksets are known, set up the base 
addresses of each bankset. The largest bankset should be mapped to the 
lowest base address. 

4.6 Bcache Initialization 

Firmware has to initialize the Bcache and memory before booting the operating 
system. The following sections describe the two methods used to initialize the 
Bcache and memory. 

4.6.1 Primary Method to Initialize the Bcache 

1. Disable the Bcache— B I U_CTL<bc_En>=0 and 21071-CA GCR<d3C_En>=0, 
GCR<bc_lgnTag>=0. 

2. Disable machine checks ABOX_CTL<JV| CH K_E N >=0. 

3. Write something to all locations throughout the available memory. This 
will put good data parity/ECC in the memory SIMMs. 

4. Enable Bcache in 21071-CA only— 21071-CA GCR<bc_En>=l, 
GCR<bc_l gnTag>=l. 

5. Clear all bits of the 21071-CA Tag Enable Register. 

6. Read all cache locations between location zero and (cache_size — 1 byte). 
Because bc_lgnTag is set, the DECchip 21071-CA will fetch data from 
memory and put it in the cache as a clean block with correct tag parity. 

Warning 



Reading an area of memory other than between location zero and 
(cache_size —1 byte) will result in leaving the Bcache and main 
memory in an incoherent state. 



7. Set the 21071-CA Tag Enable Register to the appropriate value based on 
system cache and memory size. 
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8. Clear the DECchip 21071-CA bcJgnTag— GCR<bc_lgnTag>=0. 

9. Enable Bcache in the Alpha 21064 microprocessor— BIU_CTL<bc_En>=l. 

10. Enable machine checks (if desired) AB0X_CTL<MCHK_EN>=1. 

4.6.2 Alternative Method to Initialize the Bcache 

1. Enablethe Bcache— BIU_CTL<BC_EN>=1 and 21071-CA GCR<t)C_En>=l, 
GCR<bc_lgnTag>=l, GCR<bc_NoAlloc>=l. 

2. Disable machine checks ABOX_CTL<JV| CH K_E N >=0. 

3. Clear all bits of the 21071-CA Tag Enable Register. 

4. Read all cache locations between location zero and (cache_size — 1 byte). 
(Due to random initialization of the Bcache Ram bits some of these reads 
will rarely hit in the Bcache.) 

5. Set the 21071-CA Tag E nable Register to the appropriate value based on 
system cache and memory size. 

6. Read all cache locations between location cache_size and 

((2 X cache_size) — 1 byte). All of these reads will result in a Bcache miss, 
and the 21071-CA chip will read uninitialized data from memory and put it 
in the cache as a clean block with correct tag parity. 

7. Clear 21071-CA bcl gn_Tag— GCR<bcl gn_Tag>= 0. 

8. Write something to all locations throughout the available memory. This 
will result in all of memory having correct data parity/ECC. 

9. Enable machine checks (if desired) AB0X_CTL<1VICHK_EN>=1. 
Note 



BIU_CTL and ABOX_CTL are registers in the Alpha 21064 
microprocessor. 
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5 

DECchip 21071-CA Transactions and 

Timing Diagrams 



This chapter describes the transactions that are supported by the 21071-CA 
chip on the sysBus interface and the memory interface. When a topic is 
discussed, refer to the associated timing diagram. 

5.1 sysBus Transactions 

The following sections describe the CPU, DMA, arbitration, and write speed 
transactions. 

5.1.1 CPU Transactions 

This section describes the CPU transactions. 

5.1.1.1 Idle 

When the CPU is idle, the 21071-CA chip prepares for the next CPU 
transaction. The cache controls, with the exception of sysEarlyOEEn, are 
disabled. This will enable the cache tags on a CPU read or write, and enable 
the cache data on a read. 
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5.1.1.2 Read Block 

This section describes the read block transactions. 

5.1.1.2.1 Cacheable With Victim The following table describes the cycles for 
a CPU read block transaction in cacheable space with a victim, as shown in 
Figure 5-1. 

Cycle Description 

A read block begins during the idle cycle. The address is becoming valid 
because the CPU is doing a probe of the Bcache. 

1 TheCPU requests a read block with cpuCReq<2:0>. Because sysEarlyOEEn 
was asserted, this triggers the assertion of bcDataOE and bcTagOE. 

2 The 21071-CA chip decodes sysAdr<33:5>and finds it in cacheable memory 
space. Also, the cache tag is available and indicates a victim must be 
processed. The first octaword of victim data is already on sysData<127:0>. 
To prepare for the rest of the victim, sysDataALEn is asserted, followed one- 
half cycle later by sysDataAH En. These will produce a one cycle pulse on 
bcDataA4 beginning on clk2F. To maintain the data output from the cache, 
sysEarlyOEEn is left asserted and sysDataOEEn is asserted. 

3 The second octaword of the victim is received. The 21071-CA chip prepares 
to drive the bus by deasserting the cache controls sysEarlyOEEn and 
sysDataOEEn. 

4 The read of the victim is complete. The cache tags are driven by the 
21071-CA chip with the tag information for the fill data (valid and clean). If 
the CPU requested a wrapped read, bcDataA<4> would be asserted for the 
first time. Figure 5-8 shows a wrapped LDx_L read. 

If the read is in the instruction stream, as indicated by cpuCWMask<2> 
being false, and the cache line was previously valid, the CPU internal 
Dcache is invalidated using cpuDl nvReq. 

5 The system may stall for any number of cycles waiting for the read data 
to be available, although in this example the read data is ready now. The 
read data is driven onto the data bus. Using cpuDRAck<2:0>, the data is 
acknowledged as OK. 

sysTagWE is asserted, which generates bcTagCtlWE and bcTagAdrWE to 
write the tags into the cache. 

SysDataWEEn is asserted, in turn generating bcDataWE, which writes the 
data into the cache. To prepare to write, the second octaword bcDataA<4> 
is asserted. Figure 5-1 shows a single write pulse of half the system clock 
width. The 21071-CA chip also supports a write pulse of twice that duration. 
(See Section 5.1.4.) 
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Cycle Description 



The second cxiaword is written witli sysDataWEEn, and again acl<nowledged 
as OK using cpuDRAcl«2:0>. bcDataA<4> is deasserted once we are done 
witli the write. The arbiter could decide that DMA will be granted the bus, 
as indicated by the unknown (X's) on cpuHoldReq and ioGrant. For more 
information about arbitration, see Section 5.1.3. 

The cycle is acknowledged with cpuCAck<2:0>, and the data drivers and 
cache controls are returned to their default state. It is not possible to assert 
cpuCAck<2:0> sooner, because the CPU data bus drivers could have created 
a bus contention with the memory output buffers. 

The transaction is complete and the next transaction is ready to begin. If 
the CPU won arbitration, sysEarlyOEEn will be asserted in the next cycle in 
preparation for the next transaction. If the 21071-DA chip won arbitration, 
this cycle is used for bus turnaround. 
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Figure 5-1 Timing of CPU Read Blocit, Cacheable, Victim 
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5.1.1.2.2 Cacheable Without Victim A read without a victim is similar 
to Figure 5-1. When the tag results are clean or invalid in cycle 2, the 
information read out of the cache is discarded. The transaction has the same 
length and control signals as the victim case described in Section 5.1.1.2.1. 

5.1.1.2.3 Noncacheable The following table describes the cycles for a CPU 
read block transaction in noncacheable space, as shown in Figure 5-2. 

Cycle Description 

I n read block to noncacheable space, the address is placed on the bus one 
CPU cycle (as little as 3 ns) before clklR. 

1 The CPU requests a read block with cpuCReq<2:0>. Because sysEarlyOEEn 
was asserted, this triggers the assertion of bcDataOE and bcTagOE. 

2 The 21071-CA chip decodes sysAdr<33:5> and finds it is in noncacheable 
memory space. The 21071-CA chip prepares to drive the bus so sysEarlyOEEn 
is deasserted, which deasserts bcDataOE and bcTagOE. 

3 SysTagOE is asserted to prevent the cache tags from floating. The 21071-CA 
chip waits for the cache data to tristate. 

4 The read data is ready and is driven onto the data bus. Using cpu- 
DRAck<2:0>, the data is acknowledged as noncacheable. CpuDlnvReq 
does not assert in noncacheable space. The CPU does not require more than 
an octaword of data; therefore, only one data transfer is required. 

5 The cyde is acknowledged with cpuCAck<2:0> and the data drivers are 
returned to their default state. 

6 The transaction is complete, and the next transaction is ready to begin. 
Note 



A read block with the cache disabled is similar to a noncacheable 
read. However, a full hexaword is returned, and OK will be sent on 
cpuDRack<2:0>so that the CPU will place the data in its Dcache or 
I cache. 
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Figure 5-2 Timing of CPU Read Block, Noncacheable 
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5.1.1.2.4 I/O Space The following table describes the cycles for a CPU read 
block transaction in remote I/O space, as shown in Figure 5-3. 

Cycle Description 

As I/O space is noncacheable, the address is placed on the bus one CPU cycle 
(as little as 3 ns) before clklR. 

1 The CPU requests a read block with cpuCReq<2:0>. Because sysEarlyOEEn 
was asserted, this triggers the assertion of bcDataOE and bcTagOE. 

2 The 21071-CA chip decodes sysAdr<33:5>and finds it in I/O space. To get 
the cache off the bus, while preventing the tag from floating, sysEarlyOEEn 
is deasserted and sysTagOEEn is asserted. 

3 The 21071-CA chip waits for the cache data to tristate. The 21071-DA chip 
processes the I/O read. 

4 The 21071-CA chip could return data in this cycle, but the data is not ready 
for two more cycles. 

5 The 21071-DA chip loads the merge and I/O read buffer on the 21071-BA 
chip using the eplBus. It indicates that the read data is loaded and can be 
sent to the CPU in the next cycle, so it requests a cpuDRAck<2:0> using 
ioCmd<2:0>. If more than one longword is being read, multiple epiData 
transfers are required; the last epiData transfer has the cpuDRack request. 

6 The read data is ready and is driven onto the data bus. The 21071-CA 
chip receives the cpuDRAck<2:0> request on ioCmd<2:0>and asserts 
cpuDRAck<2:0>as noncacheable. A CPU cyde acknowledge is requested 
using ioCmd<2:0>. 

7 The 21071-CA chip receives ioCmd<2:0>, tristates its data drivers, and 
acknowledges the cycle with cpuCAck<2:0>. The cache is turned off by 
deasserting sysTagOEEn. 

8 The transaction is complete, and the next transaction is ready to begin. 
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Figure 5-3 Timing of CPU Read Block, Remote I/O Space 
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5.1.1.3 Write Block 

This section describes tine write block transactions. 

5.1.1.3.1 Cacheable Allocate With Victim The following table describes the 
cycles for a CPU write block transaction in cacheable space with a victim and 
with write allocation enabled, as shown in Figure 5-4. 

Cycle Description 

A write block begins during the idle cycle. The address is becoming valid 
due to the CPU doing a probe of the Bcache. Systems may rely on cacheable 
address being set up for the time it takes the CPU to do a probe (a minimum 
of 10 ns). 

1 TheCPU requests a write block with cpuCReq<2:0>. Because sysEarlyOE En 
was asserted, this triggers the assertion of bcTagOE and cpuDOEJ. 

2 The 21071-CA chip decodes sysAdr<33:5>and finds it in cacheable memory 
space. The CPU sees the assertion of cpuDOEJ; the first octaword of write 
data is placed on the cpuData bus and is latched by the 21071-BA chip. The 
21071-CA chip asserts sysDOE to ensure that cpuDOEJ will not deassert 
too soon. The 21071-CA chip asserts sysTagOEEn to prevent the tag bus 
from floating. The 21071-CA chip also asserts cpuDWSel to get the second 
octaword of write data. The cache tag indicates that a victim must be 
processed. 

3 The CPU sees the assertion of cpuDWSel and places the second octaword of 
write data on the cpuData bus. The 21071-CA chip deasserts sysEarlyOEEn. 
The data is latched by the 21071-BA chip. The 21071-CA chip deasserts 
sysDOE and cpuDWSel. 

4 The sysData bus is tristated by the CPU. The 21071-CA chip as- 
serts sysDataOEEn causing the cache to begin driving the data bus. 
sysDataOEEn is asserted on clklF, rather than on the normal clk2F, to 
allow additional cache output enable access time. 

5 The first octaword of victim data is on sysData<127:0>and is latched by 
the 21071-BA chip. To prepare for the rest of the victim, bcDataA<4> is 
asserted. 

6 The second octaword of the victim is received. The 21071-CA chip prepares 
to drive the bus so that sysTagOEEn and sysDataOEEn are deasserted. 

7 The read of the victim is complete. The cache tags are driven by the 
21071-CA chip with the tag information for the fill data (valid and dirty). 
sysDataWEEn and sysTagWE are asserted to write the cycle data tags. 
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Cycle Description 



8 The fill data is ready and is driven on sysData<127:0>. If the CPU wrote a 
full cache line, the fill data is simply the same as the data written in cycle 2. 
Otherwise, the 21071-CA chip reads a line from memory and merges it with 
the write data to create an updated line of data. The CPU internal Dcache 
is invalidated using cpuDI nvReq. To prepare to write the second octaword, 
bcDataA<4>will change on clk2R because write timing is being used. 

9 The second octaword is written with sysDataWEEn. bcDataA<4> is 
deasserted after the write is done. 

10 The cycle is acknowledged with cpuCAck<2:0>, and the cache controls are 
returned to their default state. 

11 The transaction is complete, and the next transaction is ready to begin. 
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Figure 5-4 Timing of CPU Write Block, Cacheabte, Allocate, Victim 
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5.1.1.3.2 Cacheable Allocate Without Victim The following table describes 
the cycles for a CPU write block transaction in cacheable space without a 
victim and with write allocation enabled, as shown in Figure 5-5. 

Cycle Description 

A write block begins during the idle cycle. The address is becoming valid 
due to the CPU doing a probe of the Bcache. Systems may rely on cacheable 
address being set up for the time it takes the CPU to do a probe (a minimum 
of 10 ns). 

1 The CPU requests a write block with cpuCReq<2:0>. Because sysEarlyOE En 
was asserted, this triggers the assertion of bcTagOE and cpuDOE J. 

2 The 21071-CA chip decodes sysAdr<33:5>and finds it in cacheable memory 
space. The CPU sees the assertion of cpuDOEJ; the first octaword of write 
data is placed on the cpuData bus and latched by the 21071-BA chip. The 
21071-CA chip asserts sysDOE. The 21071-CA chip asserts sysTagOEEn to 
prevent the tag bus from floating. The 21071-CA chip also asserts cpuDWSel 
to get the second octaword of write data. The cache tag indicates no victim. 

3 The CPU sees the assertion of cpuDWSel and places the second octaword of 
write data on the cpuData bus. The 21071-CA chip deasserts sysEarlyOEEn. 
The data is latched by the 21071-BA chip. The 21071-CA chip deasserts 
sysDOE and cpuDWSel. The 21071-CA chip prepares to drive the bus so 
that sysTagOEEn is deasserted. 

4 The sysData bus is tristated by the CPU . The cache tags are driven by the 
21071-CA chip with the tag information for the fill data (valid and dirty). 

5 The fill data is ready and is driven on sysData<127:0>. If the CPU wrote a 
full cache line, the fill data is simply the same as the data written in cycle 
2. Otherwise, the 21071-CA chip reads a line from memory and merges 

it with the write data to create an updated line of data. If the old cache 
line was valid, the CPU internal Dcache is invalidated using cpuDl nvReq. 
sysDataWEEn and sysTagWE are asserted, in turn generating bcDataWE 
and bcTagWE, which write the data and tags into the cache. To prepare to 
write the second octaword bcDataA<4>is asserted. 

6 The second octaword is written with sysDataWEEn. 

7 The cycle is acknowledged with cpuCAck<2:0> and the data drivers are 
returned to their default state. cpuDOEJ is reasserted because the 
21071-CA chip is finished with the data bus. 

8 The transaction is complete, and the next transaction is ready to begin. 
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Figure 5-5 Timing of CPU Write Block, Cacheable, Allocate, No Victim 
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5.1.1.3.3 Cacheable No Allocate The following table describes the cycles 
for a CPU write block transaction with write allocation disabled, as shown in 
Figure 5-6. 

Cycle Description 

A write block begins during the idle cycle. The address is becoming valid. 
This transaction does not discriminate between cacheable and noncacheable, 
so the address is set up for only 4 ns. 

1 The CPU requests a write block with cpuCReq<2:0>. Because sysEarlyOEEn 
was asserted, this triggers the assertion of bcTagOE and cpuDOEJ. 

2 The 21071-CA chip decodes sysAdr<33:5> and finds it in cacheable memory 
space. The CPU sees the assertion of cpuDOEJ; the first octaword of write 
data is placed on the cpuData bus and is latched by the 21071-BA chip. 
The 21071-CA asserts sysDOE. The 21071-CA chip asserts sysTagOEEn to 
prevent the tag bus from floating. The 21071-CA chip also asserts cpuDWSel 
to get the second octaword of write data. 

3 The CPU sees the assertion of cpuDWSel and places the second octaword of 
write data on the cpuData bus. The 21071-CA chip deasserts sysEarlyOEEn. 
The data is latched by the 21071-BA chip. The 21071-CA chip deasserts 
sysDOE and cpuDWSel. The cache is disabled by deasserting sysTagOEEn. 
The cycle is acknowledged with cpuCAck<2:0>. 

4 The sysData bus is tristated by the CPU . 



DECchIp 21071-CA Transactions and Timing Diagrams 5-19 



Figure 5-6 Timing of CPU Write Block, Noncacheable or No Allocate 







CYO 


"V 




CY1 






CY2 


^ 


J~ 
r 


CY3 


"V 


CY4 


clkl 


' 


^ 


_7 

r 


^ 


J 


^ 


^ 


/ \ / 


clk2 


/ 




^ 


J 


J 


/ \ 








cpuHoldReq 




































cpuHoldAck 




\ 
























r 








ioGrant 




































loCmd 






idle 




















y 




^ 


























cpuCReq 


X 


idle 


X 


X write block 












X' Xidle 




























cpuCWMask 








X 




X write 


Tiask 










X: 






























cpuAdr 






X 


X write addresS' 












X' 






























cpuData 








"^> — 


-L. 


>c 


wdO 


^ 


)L 


wdl 


y-( 


drvSysData 




































A 




sysDOE 


1 




J 
































cpuDOEJ 








V 




\ 




J 
A 




cpuDWSel 


1 




J 


r 






























cpuCAck 




XokX \ 


cpuDRAck 


































"A 






















ioDataRdy 




































ioCAck 








X 


X 


Idle 














X 






























sysEarlyOEEn 








X: 


J 










v_ 


\ 




















sysTagOEEn 


/ 




J 


\_ 




"A 






















r 






bcTagCEOE 1 








V 

r 


\ 


— 


/ \ 






sysTagVDP 


X 






X 


X 








)—{ 










sysDataOEEn 
















































bcDataCEOE 1 


























\ 


cpuDinvReq 
































sysDataALEn 


X 


sysDataAHEn 


























X 


bcDataA<4> 




























sysDataWEEn 
























































bcDataWEJ 
sysTagWE 
























































bcTagCtlWEJ 




























sysCmd 








X 


X 


eset 


X 


X 


oad 


X 


X wrsy 


> X X wrsys X 






Idle 




CPU Write 




CPU Write 
and ARB 




CPU Write 1 


Terminate 



Note; 

ioRequest is not important during this transaction. 



LJ-03170-TI0 



5-20 DECchip 21071-CA Transactions and Timing Diagrams 



5.1.1.3.4 Noncacheable A write block transaction to noncaclneable space is 
identical to a write block witin write allocation disabled. See Section 5.1.1.3.3 
for a description of the transaction. 

5.1.1.3.5 I/O Space The following table describes the cycles for a CPU write 
block transaction in remote I/O space, as shown in Figure 5-7. 

Cycle Description 

During the entire time that the CPU has ownership of the bus, the 21071-DA 
chip, using ioLineSel<l:0>, provides a pointer to a free cache line buffer in 
the DMA read and I/O write buffer. 

1 The CPU requests a write block with cpuCReq<2:0>. Because sysEarlyOEEn 
was asserted, this triggers the assertion of bcTagOE and cpuDOEJ. 

2 The 21071-CA chip decodes sysAdr<33:5> and finds it in cacheable memory 
space. The CPU sees the assertion of cpuDOEJ, and the first octaword 

of write data is placed on the cpuData bus. The 21071-BA chip loads 
the data into the DMA read and I/O write buffer at the line selected by 
ioLineSel<l:0>. The 21071-CA chip asserts sysDOE. The 21071-CA chip 
asserts sysTagOEEn to prevent the tag bus from floating. The 21071-CA 
chip also asserts cpuDWSel to get the second octaword of write data. 

3 The CPU sees the assertion of cpuDWSel and places the second octaword 
of write data on the cpuData bus. The data is latched by the 21071-BA 
chip. The 21071-CA chip deasserts sysDOE and cpuDWSel. The 21071-DA 
chip, using ioCmd<2:0>, is ready to end the transaction in the next cycle, so 
cpuCAck is requested. 

4 The 21071-CA chip receives ioCmd<2:0> and acknowledges the cycle with 
cpuCAck<2:0>. The cache is turned off by deasserting sysTagOEEn. The 
21071-DA chip is free to unload the data using theepiBus. 

If the 21071-DA chip did not request a cpuCAck by this cycle, then 
sysDataOEEn will be asserted to prevent sysData<127:0>from floating. 

5 The transaction is complete, and the next transaction is ready to begin. 
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Figure 5-7 Timing of CPU Write Block, Remote I/O Space 
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5.1.1.4 LDx_L 

In general an LDx_L transaction looks like a read block. There are two major 
differences. The first is that the architecturally defined lock bit and lock 
address are set. The second is that in contrast to the read block transaction, 
the cache must be probed. (The Alpha 21064 microprocessor does not probe on 
LDx_L or STx_C.) 

5.1.1.4.1 Cacheable Hit Figure 5-8 shows an LDx_L transaction in cacheable 
space that hits. Data is not returned directly from the cache, to avoid an 
address-to-data race through the cache RAMs. Although theCPU should not 
issue one, a read block that hits in the cache will be treated as an LDx_L hit 
without the lock bit being set. 

The following table describes the cycles for an LDx_L transaction in cacheable 
space that hits, as shown in Figure 5-8. 

Cycle Description 

An LDx_L begins during the idle cycle. The address is becoming valid one 
CPU cycle before clklF, because the CPU did not probe the cache. 

1 TheCPU requests an LDx_L with cpuCReq<2:0>. Because sysEarlyOE En 
was asserted, this triggers the assertion of bcDataOE and bcTagOE. 
Wrapped return data is requested by asserting cpuCWM ask<l>. 

2 The LDx_L locked bit is set, and the LDx_L locked address is loaded from 
sysAdr<33:5>. If the 21071-DA chip is sending CIrLock on ioCmd<2:0>, then 
the lock bit is not set, and it is forced to remain clear for as long as the 
CIrLock is being sent. 

The cache tag indicates a hit. SysDataAEn is asserted as the data must be 
returned in wrapped order. If the cache line is clean, data will be wrapped 
from the memory, as in a regular wrapped read operation. 

3 Data from the cache is loaded into the 21071-BA chip merge buffer. To 
prepare to read the first octaword (since it is wrapped), bcDataA<4> 

is deasserted. The 21071-CA chip prepares to drive the bus so that 
sysEarlyOEEn is deasserted. 

4 The second octaword is loaded into the 21071-BA chip merge buffer. 

5 The 21071-CA chip waits for the cache data to tristate. 

6 The merge buffer data is driven on sysData<127:0>and acknowledged with 
cpuDRack<2:0>. 

7 The second octaword is driven and acknowledged. 

8 The cycle is acknowledged with cpuCAck<2:0>and the data drivers are 
returned to their state. 

9 The transaction is complete, and the next transaction is ready to begin. 
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Figure 5-8 Timing of CPU LDx_L, Wrapped, Cacheable Hit 
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5.1.1.4.2 Cacheable Miss An LDx_L transaction that misses in cacheable 
space is similar to Figure 5-1. Also see the description in Section 5.1.1.2.2. 

5.1.1.4.3 Noncacheable An LDx_L transaction to noncacheable space is 
identical to a read block to noncacheable space (Section 5.1.1.2.3), except that 
the lock bit and lock address must be set. 

5.1.1.4.4 I/O Space An LDx_L transaction to I/O space is treated by the 
21071-CA chip as a regular read to I/O space (although the lock bit is set). An 
implementation may treat the LDx_L as a regular read block in I/O space, flag 
an error, or implement the LDx_L. 



5.1.1.5 STx_C 

In general, an STx_C transaction looks like a write block. Also, the transaction 
may be aborted by the lock bit being cleared. The 21071-DA chip may 
ensure that STx_C to memory always fails by using the CIrLock command 
on ioCmd<2:0>. For CIrLock to affect a CPU STx_C transaction, the CIrLock 
command must be asserted in or before the first cycle of the STxC transaction 
flow. For example, a DMA read miss transaction that needs to clear the lock 
flag must do so before one cycle after the ioCAck<l:0>for the DMA read. 
This is because an STx_C transaction may potentially start in the cycle after 
ioCAck<l:0>. 

5.1.1.5.1 Cacheable Hit The following table describes the cycles for an STx_C 
transaction to cacheable space that hits in the cache, as shown in Figure 5-9. 

Cycle Description 

An STx_C begins during the idle cycle. An address is placed on the bus one 
CPU cycle before elk IF. 

1 The CPU requests an STx_C with cpuCReq<2:0>. Because sysEarlyOE En 
was asserted, this triggers the assertion of bcTagOE and cpuDOEJ. 

2 The CPU sees the assertion of cpuDOE J; the first octaword of write data is 
placed on the cpuData bus and is latched by the 21071-BA chip. The 21071- 
CA chip recognizes the transaction and tests the LDx_L lock bit, which is set 
(success). The cache tag indicates a cache hit. The 21071-CA chip asserts 
sysDOE. The 21071-CA chip asserts sysTagOEEn to prevent the tag bus 
from floating. The 21071-CA chip also asserts cpuDWSel to get the second 
octaword of write data. 

3 The CPU sees the assertion of cpuDWSel and places the second octaword 
of write data on the cpuData bus. The data is latched. The 21071-CA chip 
deasserts sysDOE, cpuDWSel, and sysEarlyOEEn. 
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Cycle Description 



4 The sysData bus is tristated by the CPU . The 21071-CA chip asserts 
sysDataOEEn, causing the cache to begin driving the data bus. 

5 The first octaword of cache data is on sysData<127:0> and is latched by the 
21071-BA chip. To prepare for the rest of the data, bcDataA<4> is asserted. 

6 The second octaword of cache data is received. The 21071-CA chip prepares 
to drive the bus by deasserting sysEarlyOEEn and sysDataOEEn. 

7 The cache read is complete. The cache tags are driven by the 21071-CA chip 
with the tag information for the fill data (valid and dirty). 

8 The fill data is ready and is driven on sysData<127:0>. The fill data is a 
merge of the data read from the cache, overlaid by the quadword or longword 
written by the STx_C. The CPU internal Dcache is not invalidated, as the 
CPU handles this case itself. sysDataWEEn and sysTagWE are asserted, in 
turn generating bcDataWE and bcTagWE, which writes the data and tags 
into the cache. To prepare to write the second octaword, bcDataA<4> is 
asserted. 

9 The second octaword is written with sysDataWEEn. 

10 The cycle is acknowledged with cpuCAcl«2:0>, and the data drivers are 
returned to their default state. 

11 The transaction is complete, and the next transaction is ready to begin. 
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Figure 5-9 Timing of CPU STx_C Succeeds, Hit, Cacheable, Allocate 



CYO CY1 


CY2 CY3 CY4 CY5 


clk1 / \ / \ / \ / \ / \ / \ / 


Clk2 . / \ . / \ • / \ ■ / \ ■ / \ ■ / \ 


cpuHoldReq 






cpuHoldAck \ \ 


ioGrant 




lOCmd idle idle 




cpuCReq . X X idle X X STx G 




cpuCWMask ! X X Write mask 




cpuAdr X X STx C cac Address , , , 






( X wdo x: X wdi :> ( X cdo X 




drvSysData 


sysDOE 1 




1 1 \ \ ' 




cpuDOEJ : \ \ 
cpuDWSel 


t 1 \ : 


1 1 \ \ ■ - 


cpuCAck 




cpuDRAck 








loDataRdy X \ 








ioCAck ■ X X idle 








sysEarlyOEEn X / 


\ \ : 






sysTagOEEn 


/ / : 






bcTaqCEOE 1 • ■ \ \ 










X hit X X : 








sysDataOEEn 


/ ; 






bcDataCEOE 1 


\ \ 


cpuDinvReq 








sysDataALEn 


: / / 


sysDataAHEn 


: / 


bcDataA<4> 


/ / 


sysDataWEEn 








bcDataWEJ 
sysTaqWE 








bcTagCtlWEJ 




sysCmd X X reset 


< X load X X rnerge X X mergeX X load X 


Idle STx G Write 
Note: 


Write Data Write Data 1 BUS Cache Read 
Tag Probe Turnaround 



ioRequest is not important during this transaction. 



5-28 DECchip 21071-CA Transactions and Timing Diagrams 





r 


CY6 






CY7 






CY8 




CY9 


"V_ 




CY1 







CY11 


clk1 


^ 


/ 

"A 


S 


^ 




\ 




J~ 




"^ 


J~ 




-^ 


\ 






clk2 


J 




J V_ 






/ \ 










cpuHoldReq 


/ 


























































cpuHoldAck 


X 










































ioGrant 


/ 


























































ioCmd 
















idle 


















X 










































cpuCReq 


































X' 


X idle 








































cpuCWMask 


































X: 










































cpuAdr 


































X' 






)c 






r 






























cpuData 


X cdi X : y 


~^ 


X fdO 


_X_ 


— 


X 


fd 


-^ 


— 


— (, 














drvSysData 






J 










\ 






\ 




sysDOE 


































































cpuDOEJ 
cpuDWSel 












































































cpuCAck 


t 






IXOK)C 


\ 


cpuDRAck 






































































"\ 






ioDataRdy 


f 


/ 










\ 






































ioCAck 




idle 






















X 












sysEarlyOEEn 












































































sysTagOEEn 




^ 


\ ■ 








































bcTagCEOE 1 


r 


/ 


y- 

"A 




























X 












sysTagVDP 








"^ 






V,nD 




— 









— 


























sysDataOEEn 






\ 






































bcDataCEOE 1 


/ 


J 
























\ 
















cpuDinvReq 


/ 




/ 




\ 


\ 










"A 












r 












^ 






sysDataALEn 




\ 




J 




^ 

^ 




X 








:^ 




— 




















sysDataAHEn 


— 


J 


\ 


r 




J 
r 




\ 




X 


















bcDataA<4> 


/ 




\ \ 


/ 






^ 


\ 


r~ 






























sysDataWEEn 


/ 


/ 











\ 




\ 


































bcDataWEJ 
















\\ 


JJ 




U 


JJ 












sysTagWE 


/ 


/ 


\ 




\ 






















r 




















bcTagCtlWE_l 


Nc 

io 












:V 


\ 


— 


-J 
















sysCmd 


X 


ovly X 




X 


Dvly 


X 


X nop 


X 




X 


nop 


X 




X 


eset X 


X reset X 




Cache Read 1 

te: 

Request is not impor 


BUS 

Turnaround 
tant during thi 


Cache Fiii 
and ARB 
s transaction. 




Cache Fill 




Terminate 




Next Trans 
LJ'03131TIO 



DECchip 21071-CA Transactions and Timing Diagrams 5-29 



5.1.1.5.2 Cacheable Miss Figure 5-10 shows an STx_C transaction to 
cacheable space that misses the cache. The figure shows only a write allocation 
and victim. See Figure 5-5 and Figure 5-6 for examples of no victim and no 
write allocation. 

The following table describes the cycles for a CPU STx_C transaction to 
cacheable space that hits in the cache, as shown in Figure 5-10. 

Cycle Description 

An STx_C begins during the idle cycle. An address Is placed on the bus one 
CPU cycle before elk IF. 

1 The CPU requests an STx_C with cpuCReq<2:0>. Because sysEarlyOE En 
was asserted, this triggers the assertion of bcTagOE and cpuDOE J. 

2 The CPU sees the assertion of cpuDOE J; the first octaword of write data is 
placed on the cpuData bus and is latched by the 21071-BA. The 21071-CA 
chip recognizes the transaction and tests the LDx_L lock bit, which is 

set (success). The cache tag indicates a cache miss. This cycle and the 
remaining cycles of the STx_C miss transaction are the same as write block 
cycles 2 and onward of Sections 5.1.1.3.1 (victim), 5.1.1.3.2 (no victim), and 
5.1.1.3.3 (write allocation disabled). 
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Figure 5-10 Timing of CPU STxC Succeeds, Miss, Caclieable, Allocate, Victim 
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Note: 

ioRequest is not important during this transaction. 
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ioRequest is not important during this transaction. 



5-32 DECchip 21071-CA Transactions and Timing Diagrams 



5.1.1.5.3 Noncacheable The following table describes the cycles for a CPU 
STx_C transaction to noncacheable space. This transaction lool<s the same as 
the noncacheable write blocl< transaction shown in Figure 5-6. 

Cycle Description 

An STx_C begins during the idle cycle. An address is placed on the bus one 
CPU cycle before clklF. 

1 The CPU requests an STx_C with cpuCReq<2:0>. Because sysEarlyOE En 
was asserted, this triggers the assertion of bcTagOE and cpuDOE J. 

2 The 21071-CA chip recognizes the transaction and tests the LDx_L 
lock bit, which is set (success). It decodes sysAdr<33:5> and finds it in 
cacheable memory space. This cycle and the remaining cycles of the STx_C 
noncacheable transaction are the same as noncacheable write block cycles 2 
and onward, which are described in Section 5.1.1.3.3. 

5.1.1.5.4 I/O Space Similar to LDx_L, an STx_C transaction to I/O space is 
treated by the 21071-CA chip as a write block to I/O space. An implementation 
may perform the STx_C or flag an error. 

5.1.1.5.5 Fail If the LDx_L lock bit is not set, or if the 21071-DA chip is 
sending CIrLock on ioCmd<2:0> (forcing the lock bit to remain clear), an STx_ 
C instruction will fail. 

The following table describes the cycles for a CPU STx_C fail transaction, as 
shown in Figure 5-11. 

Cycle Description 

An STx_C begins during the idle cycle. An address is placed on the bus one 
CPU cycle before clklF, as the CPU did not probe the cache. 

1 The CPU requests the transaction with cpuCReq<2:0>. Because sysEarlyOEEn 
was asserted, this triggers the assertion of bcTagOE and cpuDOEJ. 

2 The first octaword of data is received. The 21071-CA chip asserts sysDOE, 
cpuDWSel, and sysTagOEEn. 

3 The 21071-CA chip recognizes the transaction and tests the LDx_L lock bit, 
which is clear (fail). The latched write data is discarded. The 21071-CA chip 
deasserts sysEarlyOEEn and acknowledges the cyde with cpuCAck<2:0>. 

4 The transaction is complete, and the next transaction is ready to begin. 



DECchIp 21071-CA Transactions and Timing Diagrams 5-33 



Figure 5-11 Timing of CPU STx_C Fails 
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ioRequest is not important during this transaction. 
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5.1.1.6 Barrier 

The following table describes the cycles for a memory barrier transaction, as 
shown in Figure 5-12. 

Cycle Description 

A barrier begins during tlie idle cycle. An address is placed on the bus one 
CPU cycle before clklF, but is ignored. 

1 The CPU requests the transaction with cpuCReq<2:0>. Because sysEarlyOE En 
was asserted, this triggers bcDataOE and bcTagOE to turn on. (This is done 
to avoid having the data and tag buses float, because the CPU does not drive 
the data or tags during these transactions.) 

2 The 21071-DA chip recognizes the transaction and requests that an OK be 
sent using cpuCAcl«2:0> in the next cycle. The 21071-DA chip could also 
preempt the barrier at this point. The 21071-CA chip asserts sysDataOEEn 
and sysTagOEEn. 

3 The 21071-CA chip receives the request on ioCmd<2:0>, deasserts 
sysTagOEEn, sysDataOEEn, and sysEarlyOEEn, and acknowledges the 
cycle with cpuCAck<2:0>. 

4 The transaction is complete, and the next transaction is ready to begin. 
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Figure 5-12 Timing of CPU Barrier or Fetch or FetchM 



CYO CY1 CY2 CY3 CY4 


clk1 / \ / \ / \ / \ / \ / 


Clk2 . / \ . / \ / \ / \ / \ 


cpuHoldReq 




cpuHoldAck \ \ ■ 




iotarant , , , / 




loumd idle : X XcackcpuX X idle X 




cpuGHeq X X idle X X barrier • X' Xidle 




cpuAdr X : Xrandom address X: 








drvSysData 


sysDOE 




cpuDOEJ , , , , , 
cpuDWSel 




cpuCAck / XOKX \ 


rpiiHRArk 




loDataRdy ■ \ ■ 




loCAck : X X idle : X : 




sysEarlyOEEn X' / ' \ \ ' 




sysTagOEEn : / /• \ \ 




bcTagCEOE 1 : : \ \ : :/ / \ 










sysDataOEEn / : / \ : \ 




bcDataCEOE 1 ■ NX' 7 / \ 


cpuDlnvReq '.'.'.'.'.'. 




svsDataALEn ' / 


sysDataAHEn / 


bcDataA<4> ' / 


sysDataWEEn 




bcDataWE_l 
sysTaqWE 




bcTagCtlWE_l , , , , , 


sysCmd ■ y X reset X X nop X X nop X X nop X 


Idle : Start TRANS : CACK Request ! Terminate Next TRANS 



Note: 

ioRequest and cpuCWMask are not important during tills transaction. 
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5.1.1.7 Fetch, FetchM 

These CPU transactions, similar to tinose sinown in Figure 5-12, may 
be supported as desired by a particular implementation. The simplest 
implementation looks like an 5Tx_C fail. 

The following table describes the cycles for a fetch or fetchM transaction. 

Cycle Description 

A fetch or fetchM begins during the idle cycle. An address is placed on the 
bus one CPU cycle before clklF, but is Ignored. 

1 The CPU requests the transaction with cpuCReq<2:0>. Because sysEarlyOE En 
was asserted, this triggers bcDataOE and bcTagOE to turn on. (This is done 
to avoid having the data and tag buses float, because the CPU does not drive 
the data or tags during these transactions.) 

2 A wait state is performed. 

3 The 21071-CA chip recognizes the transaction, deasserts sysEarlyOEEn, and 
acknowledges the cycle with cpuCAck<2:0>. 

5.1.2 DMA Transactions 

After DMA wins arbitration, it may initiate a transaction with the 21071-CA 
chip. Unlike the CPU transactions, the only unit of transfer for DMA 
transactions is a cache line. 

5.1.2.1 DMA Idle 

When DMA has the bus, the CPU is isolated by holding cpuDWSel, and 
sysEarlyOEEn is deasserted. The cache is prepared for a probe by the 
21071-CA chip asserting sysDataOEEn and sysTagOEEn in the first cycle 
that a DMA transaction may begin. The cache also drives the data bus in case 
a DMA read or write hits the cache. 
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5.1.2.2 DMA Read 

This section describes the DIMA read transactions. 

5.1.2.2.1 Cacheable Hit The following table describes the cycles for a DIM A 
read transaction in cacheable space that hits, as shown in Figure 5-13. 

Cycle Description 

The transaction begins with DMA owning the bus, as indicated by the 
assertion of ioGrant. 

1 The 21071-DA chip requests a DMA read with ioCmd<2:0>, places the 
address on sysAdr<33:5>, and points to a line to be loaded in the DMA read 
and I/O write buffer with ioLineSel<l:0>. 

2 The 21071-CA chip decodes sysAdr<33:5> and finds it in cacheable memory 
space. The 21071-CA chip waits for the cache probe, which indicates a cache 
hit. The first octaword of data is already on the data bus, so it is loaded into 
the DMA read buffer. (If the read was wrapped, the data would be invalid 
and would have to be loaded in the next cycle.) To prepare for reading the 
second octaword, bcDataA<4> is asserted. 

3 The 21071-CA chip loads the second octaword of read data into the DMA 
read buffer and indicates data ready with ioDataRdy The transaction is 
acknowledged with ioCAcl«l:0>. If the 21071-DA chip won arbitration, 
it may start a new read transaction in the next cycle. If the CPU won 
arbitration, this cycle is used for bus turnaround. 

4 The transaction is complete, and the next transaction is ready to begin. 
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Figure 5-13 Timing of DMA Read, Cacheable, Hit 
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5.1.2.2.2 Cacheable Miss The following table describes the cycles for a DMA 
read transaction in cacheable space that misses, as shown in Figure 5-14. 

Cycle Description 

The transaction begins with DMA owning the bus, as indicated by the 
assertion of ioGrant. 

1 The 21071-DA chip requests a DMA read with ioCmd<2:0>, places the 
address on sysAdr<33:5>, and points to a line to be loaded in the DMA read 
and I/O write buffer with ioLineSel<l:0>. 

2 The 21071-CA chip decodes sysAdr<33:5>and finds it in cacheable memory 
space. Also, the cache tag, available this cycle, indicates a cache miss. 

3 The read data could be returned to the 21071-DA chip in this cycle, although 
it is shown to take until cycle 5. If the arbitration allows a release, and 
the 21071-CA chip is not in the middle of a preemption, the CPU may be 
released to use the cache. If so, the 21071-CA chip deasserts cpuHoldReq, 
sysTagOEEn, and sysDataOEEn. Section 5.1.3.2.4 describes returning from 
a released CPU to a DMA transaction. 

4 The 21071-CA chip waits for read data to return. SysEarlyOEEn is asserted 
so that if the CPU starts an external transaction, the tag and sysData buses 
will not float. 

5 The first octaword of read data is loaded into the DMA read and I/O write 
buffer. The 21071-CA chip indicates the transfer by asserting ioDataRdy 

6 The 21071-CA chip waits for the second quadword of read data to return. 

7 The second octaword is loaded into the DMA read and I/O write buffer and 
is acknowledged with ioDataRdy The transaction is acknowledged with 
ioCAck<l:0>. 

8 The transaction is complete, and the next transaction is ready to begin. 
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Figure 5-14 Timing of DMA Read, Cacheable, Miss 
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5.1.2.2.3 Noncacheable A DMA read transaction to noncacheable space is 
similar to the cacheable miss shown in Figure 5-14. Due to internal timing 
issues, the probe cycle still exists, but the probe results are ignored. 

5.1.2.2.4 I/O Space DMA transactions are not supported to I/O space; they 
should be responded to as an error using ioCAck<l:0> (Figure 5-15). 

The following table describes the cycles for a DMA read, I/O space transaction, 
as shown in Figure 5-15. 

Cycle Description 

The transaction begins with DMA owning the bus, as indicated by the 
assertion of ioGrant. 

1 The 21071-DA chip requests a DMA read with ioCmd<2:0>, places the 
address on sysAdr<33:5>, and points to a line to be loaded in the DMA read 
and I/O write buffer with ioLineSel<l:0>. 

2 The 21071-CA chip decodes sysAdr<33:5> and finds it is in 1/0 space. The 
21071-CA chip turns on its sysData drivers for this one cycle to prevent a 
floating bus. 

3 The cycle is acknowledged as an error with ioCAck<l:0>. 

4 The transaction is complete, and the next transaction is ready to begin. 
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Figure 5-15 Timing of DMA Read, I/O Space (Error) 
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5.1.2.3 DMA Read Wrapped 

The transaction for DMA read wrapped is the same as that of DMA read. The 
return read data is returned with octaword 1, followed by octaword 0. This is 
done by asserting sysDataAEn for the first octaword and deasserting it for the 
second. 

5.1.2.4 DMA Read Burst 

The transaction for DMA read burst is the same as that of DMA read, except 
that the transaction includes a hint that the next transaction will be to the 
next cache line address. 

5.1.2.5 DMA Read Wrapped Burst 

The transaction for DMA read wrapped burst is the same as that of DMA read, 
except that it contains the next line hint in DMA read burst and includes the 
wrapping in DMA read wrapped. 

5.1.2.6 DMA Write 

A DMA write releases the cache when the memory write buffer is full and the 
write does not hit in the cache. 
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5.1.2.6.1 Cacheable Hit The following table describes the cycles for a DMA 
write transaction in cacheable space that hits, as shown in Figure 5-16. The 
cache is invalidated rather than updated. 

Cycle Description 

The transaction begins with DMA owning the bus, as indicated by the 
assertion of ioGrant. 

1 The 21071-DA chip requests a DMA write with ioCmd<2:0>, places the 
address on sysAdr<33:5>, and points to the DMA write buffer cache line 
with write data using ioLineSel<l:0>. 

2 The 21071-CA chip decodes sysAdr<33:5> and finds it in cacheable memory 
space. Also, the cache tag indicates a cache hit. The 21071-BA chips 
internally transfer the first octaword of DMA write data to the memory 
write buffer. To prepare to invalidate thecache, sysTagOEEn is deasserted. 

3 The cache tags are driven by the 21071-CA chip as invalid. The second 
octaword is transferred. 

4 The tags are written by asserting sysTagWE for one cycle. The cache data is 
not written. bc_LongWR does not affect this transaction. 

5 The 21071-CA chip tristates the tags. The transaction is acknowledged with 
ioCAcl«l:0>. (The acknowledgment could not be done in cycle 4 because the 
address was still required to do the invalidate.) 

6 The transaction is complete, and the next transaction is ready to begin. 



5-46 DECchip 21071-CA Transactions and Timing Diagrams 



Figure 5-16 Timing of DMA Write, Cacheable, Hit, Followed by DMA Read 
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5.1.2.6.2 Cacheable Miss The following table describes the cycles for a DMA 
write transaction in cacheable space, as shown in Figure 5-17. 

Cycle Description 

The transaction begins with DMA owning the bus, as indicated by the 
assertion of ioGrant. 

1 The 21071-DA chip requests a DMA write with ioCmd<2:0>, places the 
address on sysAdr<33:5>, and points to the DMA write buffer cache line 
with write data using ioLineSel<l:0>. 

2 The 21071-CA chip decodes sysAdr<33:5>and finds it in cacheable memory 
space. Also, the cache tag indicates a cache miss. The 21071-BA chips 
internally transfer the first octaword of DMA write data to the memory write 
buffer. 

If the cache is disabled (bc_EN =0), the tag probe results are ignored 
(assumes a miss), and the CPU internal Dcache is invalidated with 
cpuDinvReq. 

If the memory write buffer was full, the probe is completed and the write 
data is not transferred. If the probe missed, the arbitration may release the 
cache, and the transaction will continue when the memory write buffer is no 
longer full. 

3 The transaction is acknowledged with ioCAck<l:0>. (The acknowledgment 
could not be done in cycle 2 because the tag results were not available yet.) 

4 The transaction is complete, and the next transaction is ready to begin. 
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Figure 5-17 Timing of DMA Write, Cacheable, Miss, Followed by CPU Write 
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5.1.2.6.3 Noncacheable A DMA write transaction in noncacheable space is 
similar to a DMA write miss, as shown in Figure 5-17. Although the tag probe 
results do not matter, the timing of internal transfers and the acknowledgment 
are the same. The acknowledgment cannot be done in cycle 2 because of the 
time required to determine whether the transaction is to a valid memory 
location. 

5.1.2.6.4 I/O Space DMA transactions are not supported to I/O space; they 
should be responded to as an error using ioCAck<l:0>. This is shown in 
Figure 5-15 and is described in Section 5.1.2.2.4. 

5.1.2.7 DMA Write Masked 

A DMA write masked transaction is a combination of the DMA read and 
DMA write transactions. I n a DMA write masked transaction, the cache or 
memory is read as it is in a DMA read transaction. The results of the read are 
combined with the DMA write buffer and are loaded into the memory write 
buffer. 
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5.1.2.7.1 Cacheable Hit The following table describes the cycles for a 
DMA write masked transaction in cacheable space that hits, as shown in 
Figure 5-18. 

Cycle Description 

The transaction begins with DMA owning the bus, as indicated by the 
assertion of ioGrant. 

1 The 21071-DA chip requests a DJVl A write masked with ioCmd<2:0>, places 
the address on sysAdr<33:5>, and points to the DMA write buffer cache line 
with write data using JoLineSel<l:0>. 

2 The 21071-CA chip decodes sysAdr<33:5> and finds it in cacheable memory 
space. The 21071-CA chip waits for the cache probe, which indicates a 
cache hit. The first octaword of data is already on the data bus. The data is 
merged (based on the byte enables) with the DMA write buffer and is loaded 
into the memory write buffer. To prepare for reading the second octaword, 
bcDataA<4>is asserted. 

3 The cache tags are driven by the 21071-CA chip as invalid. The 21071-CA 
chip reads the second octaword of cache data, merges it, and places it into 
the memory write buffer. 

4 The tags are written by asserting sysTagWE for one cycle. 

5 The 21071-CA chip tristates the tags. The transaction is acknowledged with 
ioCAck<l:0>. (The acknowledgment could not be done in cycle 4 because the 
address was still required in order to do the invalidate.) 

6 The transaction is complete, and the next transaction is ready to begin. 
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Figure 5-18 Timing of DMA Write Masked, Cacheable, Hit 
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5.1.2.7.2 Cacheable Miss A DMA write masked transaction that misses the 
cache looks externally identical to a DMA read that misses the cache. This is 
described in Section 5.1.2.2.2. (The internal merge and transfer to the memory 
write buffer is invisible.) The cache may be released. 

5.1.2.7.3 Noncacheable A DMA write masked transaction to noncacheable 
space looks externally identical to a regular noncacheable DMA read. This is 
described in Section 5.1.2.2.3. 

5.1.2.7.4 I/O Space Any DMA transaction to I/O space is an error and is 
described in Section 5.1.2.2.4. 



5.1.2.8 DMA Flush 

A DMA flush transaction is used to ensure that the 21071-CA chip write buffer 
is empty. This may be required to guarantee the limited memory access time 
required by ISA and EISA devices. 

The following table describes the cycles involved for a DMA flush transaction, 
as shown in Figure 5-19. 

Cycle Description 

The transaction begins DMA owning the bus, as indicated by the assertion of 
ioGrant. 

1 The 21071-DA chip requests a DMA flush with ioCmd<2:0>and places an 
arbitrary address on sysAdr<33:5>. 

2 The 21071-CA chip checks to see if its write buffer is empty. I n this figure, it 
is not emptied for two cydes, so the 21071-CA chip waits. 

3 If the write buffer was empty, ioCAck<l:0> would be in this cycle. It is not, 
so the 21071-CA chip continues to wait. 

4 The 21071-CA chip continues to wait. 

5 The 21071-CA chip determines that its write buffer is empty, and the 
transaction is acknowledged with ioCAck<l:0>. 

6 The transaction is complete, and the next transaction is ready to begin. 
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Figure 5-19 Timing of DMA Flush 
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5.1.3 Arbitration Transactions 

This section describes tine arbitration transactions. 

5.1.3.1 Back-to-Back Transactions 

This section describes the CPU-to-CPU and DIMA-to-DIMA transactions. 

5.1.3.1.1 CPU-to-CPU Figure 5-20 shows the actions between two back-to- 
back CPU transactions. It shows a CPU cacheable read followed by a CPU 
write, although this description is applicable to any transaction. 

The following table describes the cycles for back-to-back transactions, as shown 
in Figure 5-20. 

Cycle Description 

A cacheable read block transaction is in progress, as described in cycle 6 of 
Section 5.1.1.2.1. 

1 I n the cyde that cpuCAck<2:0> is sent, the cache controls are set inactive, 
with sysEarlyOEEn, sysTagOEEn, and sysDataOEEn all deasserted. 

2 The previous transaction is done. To prepare for the next CPU transaction, 
sysEarlyOEEn is asserted. 

3 cpuDOEJ is asserted. A CPU write transaction is next, as described in 
cycle 1 of Section 5.1.1.3.1. 
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Figure 5-20 Switch From CPU Read to CPU Write 
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5.1.3.1.2 DMA-to-DMA The actions between two back -to-back DMA 
transactions are sinown in Figures 5-21 and 5-22. Figure 5-21 sinows a DIM A 
read Init followed by a DMA write, and Figure 5-22 siiows a DMA write init 
followed by a second DMA write. This description applies to any back-to-back 
DMA transaction. 

The following table describes the cycles for back -to-back DMA transactions 
shown in Figure 5-21. 

Cycle Description 

A DMA read miss transaction is in progress, as described in cycle 5 of 
Section 5.1.2.2.2. If not already in the proper state, sysDataOEEn and 
bcDataA<4> are deasserted. 

1 The DMA read miss transaction is finished with ioCAck<l:0> being sent. 

2 A DMA write transaction is next, as described in cycle 1 of Section 5.1.2.6.2. 

The following table describes the cycles for back-to-back transactions shown in 
Figure 5-22. 

Cycle Description 

A DMA write hit transaction is in progress, as described in cycle 3 of 
Section 5.1.2.6.1. 

1 The DMA write hit transaction is still in progress. 

2 The DMA write hit transaction is finished. ioCAck<l:0> = OK is sent. 

3 A DMA write transaction is next, as described in cycle 1 of Section 5.1.2.6.2. 
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Figure 5-21 Switch From DIUIA Read Hit to DMA Write 
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Figure 5-22 Switch from DMA Write Hit to DMA Write 
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5.1.3.2 Transitions 

This section describes tlie transition transactions. 

5.1.3.2.1 CPU-to-DMA Figure 5-23 shows when the arbiter decides that the 
sysBus will be granted to DMA and several signals must change their default 
states in preparation for the DMA transaction. 

The following table describes the cycles for a CPU read to DMA write 
transactions, as shown in Figure 5-23. 

Cycle Description 

A CPU read block cacheable with victim transaction is in progress, as 
described in cycle 5 of Section 5.1.1.2.1. The 21071-CA chip samples the 
ioRequest<l:0> signals for a request in this cycle. (This figure represents the 
earliest possible sampling, two cycles before a transaction is acknowledged 
on ioCAck<l:0>or cpuCAck<2:0>.) 

1 The arbiter decides that the 21071-DA chip will be granted the bus. While 
the read is finishing, cpuHoldReq and ioGrant are asserted. 

2 The 21071-DA chip detects the assertion of ioGrant, ignores any CPU 
transaction to its space, and waits for cpuHddAck to assert. 

3 The 21071-CA and 21071-DA chips wait for cpuHoldAck to assert. In the 
fastest case, cpuHoldAck asserts this cycle. 

4 In this cycle, the CPU issues cpuHoldAck and tristates its buses. 

5 The cache tags and data are enabled with sysTagOEEn and sysDataOEEn. 
The 21071-DA chip drives sysAdr<33:5>and places a DMA command request 
on ioCmd<2:0>. The details of the DMA write transaction are described in 
cycle 1 of Section 5.1.2.6.2. The 21071-CA chip receives the command and 
processes it. 
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Figure 5-23 Switch from CPU Read to DMA Write 
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5.1.3.2.2 DMA to CPU, Cache Not Released When the 21071-DA chip owns 
the sysBus and cache, and the arbiter is ready to grant the bus back to the 
CPU, the cache and CPU controls must switch back to their CPU defaults. 

The descriptions in the following tables apply to any back-to-back DMA 
transaction. 

The following table describes the cycles for a DMA write hit followed by a CPU 
write, as shown in Figure 5-24. 

Cycle Description 

A DMA write hit transaction is in progress, as described in cycle 4 of 
Section 5.1.2.6.1. The 21071-DA chip Indicates that it does not have 
additional DMA transactions to complete by sending idle on ioRequest<l:0>. 
(Or the CPU has priority and is requesting a cyde on cpuCReq<2:0>.) 

1 One cyde before the cycle ioCAck<l:0> asserts, the 21071-CA chip deasserts 
ioGrant and sysDataOEEn. 

2 The 21071-CA chip deasserts cpuHoldReq and sysTagOEEn. (In the figure, 
sysTagOEEn was already deasserted for the invalidate.) The 21071-DA chip 
detects the deassertion of ioGrant, tristates its address buffers, and waits for 
cpuHoldAck to deassert. 

3 CpuHoldAck deasserts in this cyde. 

4 The 21071-CA chip asserts sysEarlyOEEn and may begin processing the 
CPU transaction. 

5 The CPU write appears on sysData. 
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The following table describes the cycles for a DMA read hit followed by a CPU 
write, as shown in Figure 5-25. 

Cycle Description 

A DMA read hit transaction is in progress, as described in cycle 2 of 
Section 5.1.2.2.1. 

1 One cycle before the cycle ioCAcl«l:0> asserts, the 21071-CA chip deasserts 
ioGrant and sysDataOEEn. 

2 The 21071-CA chip deasserts cpuHoldReq and sysTagOEEn. (In the figure, 
sysTagOEEn was already deasserted for the invalidate.) The 21071-DA chip 
detects the deassertion of ioGrant, tristates its address buffers, and waits for 
cpuH oldAck to deassert. 

3 CpuHoldAck deasserts in this cycle. 

4 The 21071-CA chip asserts sysEarlyOEEn and may begin processing the 
CPU transaction. 
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Figure 5-24 Switch from DMA Write Hit to CPU Write 
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Figure 5-25 Switch from DMA Read to CPU Write 
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5.1.3.2.3 DMA to CPU, Cache Previously Released If the arbitration allows 
cache releases, the 21071-CA chip may have released the cache to the CPU 
after a DMA read or write. This is indicated by ioGrant and cpuHoldReq 
being deasserted during a DMA transaction. To grant the sysBus to the CPU, 
additional signals must be changed. This is shown in Figure 5-26 and is 
described in the following table. 

Cycle Description 

A DMA read miss transaction is in progress, as described in cycle 6 of 
Section 5.1.2.2.2. One cycle before ioCAcl«l:0> asserts, the 21071-CA chip 
decides that the CPU has won arbitration. 

1 After the cycle JoCAck<l:0> asserts, the 21071-CA chip may begin processing 
the CPU transaction. 

2 The CPU transaction begins. 

3 The CPU transaction continues. 

5.1.3.2.4 DMA to DMA, Cache Previously Released To grant the cache back 
to the 21071-DA chip after a release, the CPU must be forced off the cache. 
This is shown in Figure 5-27 and is described in the following table. 



Cycle Description 



A DMA read miss transaction is in progress, as described in cycle 6 of 
Section 5.1.2.2.2. The 21071-CA chip decides that the 21071-DA chip has 
won arbitration and asserts cpuHoldReq and ioGrant. 

1 The 21071-DA chip sees ioGrant asserted and waits for cpuHoldAck. 

2 The 21071-CA and 21071-DA chips wait for cpuHoldAck. I n the fastest case, 
cpuHoldAck asserts in this cycle. 

3 The 21071-CA chip asserts sysDataOEEn and sysTagOEEn. The 21071-DA 
chip sees cpuHoldAck asserted and begins the DMA write. The 21071-CA 
chip may start the cache probe for the DMA write. 

4 The DMA transaction continues. 
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Figure 5-26 Switch from CPU Released to CPU Write 
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Figure 5-27 Switch from CPU Released to DIVIA Write 
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5.1.3.3 Preemption 

Reads and writes to I/O space, and all barriers may be preempted by the 
21071-DA chip. A preemption causes the current CPU transaction to be 
suspended and causes DMA transactions to be performed. After the DMA 
transactions are complete, the suspended CPU transaction is resumed. 

5.1.3.3.1 I/O Write Preempted for DIUIA Write The following table describes 
the cycles for a write block transaction to remote I/O space that requires 
preemption, as shown in Figure 5-28. 

This section describes the details of the preemption. For details about the 
write block to I/O space, see Section 5.1.1.3.5; for details about the DMA read, 
see Figure 5-13. 

Cycle Description 

The bus is idle and is owned by the CPU. 

1 The CPU requests a read block to I/O space with cpuCReq<2:0>. 

2 The 21071-DA chip determines that the I/O read creates a deadlock condition 
and requests a preempt using ioRequest<l:0>. 

Note 



Preempt cannot be requested during an I/O write 
until the CPU data has been latched, otherwise that 
data will be lost. 



3 The 21071-CA chip receives the preempt and asserts cpuHoldReq and 
ioGrant. 

4 The 21071-DA chip receives ioGrant and waits for cpuHoldAck. 

5 The CPU happens to assert cpuHoldAck this cycle. 

6 The 21071-CA chip receives cpuHoldAck and turns the cache on with 
sysDataOEEn and sysTagOEEn. The 21071-DA chip places its transaction 
on the bus. It also determines that another DMA transaction will not 

be required inside the preempt, and it returns ioRequest<l:0>to idle (or 
request if a regular DMA is desired after the I/O write). 

7 The 21071-CA chip detects a cache hit and loads the DMA read and I/O 
write buffer with the data. 
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Cycle Description 



8 The 21071-CA chip loads the second octaword of data and acknowledges the 
DMA transaction on ioCAcl«l:0>. It samples ioRequest<l:0>and finds that 
the preempt no longer exists, and it deasserts ioGrant, sysTagOEEn, and 
sysDataOEEn. 

9 The 21071-DA chip sees the deassertion of ioGrant and tristates its drivers. 
It also sees ioCAck<l:0>and knows that the DMA transaction is complete. 
The 21071-CA chip deasserts cpuHoldAck. 

The 21071-DA chip sees that the DMA transaction was complete last cycle, 
and no more preemption is required, so it may request a cpuCAck on 
ioCmd<2:0> in this cycle. As the sysData drivers have not been enabled 
yet, a cpuDRAck<2:0> request on ioCmd<2:0> may not be sent until the next 
cycle. 

10 CpuHoldAck deasserts, and the 21071-CA chip sees cpuHoldAck deasserted. 

The 21071-CA chip enables its data bus drivers if an I/O read was 
preempted. The 21071-DA chip detects ioGrant and cpuHoldAck both 
deasserted and continues the preempted CPU transaction. The remaining 
cycles are the same as the regular non-preempted transaction, resuming 
where the preempt interrupted it. 
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Figure 5-28 Timing of CPU Write Block to I/O Space, Preempted by a DMA Read Hit 
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5.1 .4 Write Speed 

The 21071-CA chip supports two different speeds for writing the cache. A 
system must determine which speed is required based on the RAIM setup, hold, 
and pulse width constraints. 

Note 



Different PAL equations are required for each mode. 



The normal speed allows one octaword of data to be written each cycle. It 
is the default and is indicated by the bc_LongWr bit in the general control 
register being clear. Figure 5-29 shows the timing of two back-to-back writes. 
This mode is also used as the base for all of the transaction timing diagrams in 
this chapter. 

Figure 5-29 Timing of Regular Writes 
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Long writes allow one octaword of data to be written in two cycles. It is 
indicated by tlie bc_LongWr bit in the general control register being set. 
Figure 5-30 shows the timing of two bacl<-to-back writes. All transactions that 
are limited by the write speed and not by memory or I/O read throughput will 
be two cycles longer (except for DMA write invalidate, which is not affected by 
bc_LongWr). bcDataA<4>will be stable for both cycles. SysDataLongWE will 
pulse for one cycle and may be delayed to generate the write pulse. 

Figure 5-30 Timing of Long Writes 
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5.2 Memory Transactions 

This section describes tine transaction timing on the memory interface. 

5.2.1 Memory Read Followed by a Page Mode Memory Read 

The following table describes a memory read followed by a page mode read, as 
shown in Figure 5-31. 

Cycle Description 

The transaction starts when memclkR coincides with clk2R. The 21071-CA 
has started driving the row address on the memAdr<ll:0>pins, one-quarter 
cycle earlier. 

1 The 20171-CA asserts the appropriate memRASJ<8:0>or memRASBJ<8:0> 
lines after waiting for the row address setup time. This example uses a 
ROWSETUP value of 0. 

The memData drivers are turned off in cycle 1 because the current 
transaction is a read. 

2 The 20171-CA waits for the row address hold time to be satisfied. 

3 The 20171-CA commences driving the column address after waiting for the 
row address hold time. This example uses a ROWHOLD value of 1. 

4 The 20171-CA asserts memCASJ<3:0> after waiting for the column address 
setup time. This example uses a COLSETUP value of 0. 

The 21071-CA changes the memCmd<3:l> from NOP toRDIMM, indicating 
to the 21071-BA chips that memory data should be latched on the rising 
edge of memClk in cycle 6. This example uses a RDIyRow value of 2. 

The 21071-CA will change the column address to point to the next column 
after the column address hold time has been satisfied. This example uses a 
ColHold value of 1, causing the next column address to be driven in cycle 6. 

5 The memCASJ<3:0>pins remain asserted. 

6 The 20171-CA deasserts memCASJ<3:0>, because the value of RTCAS 
is in this example. Data is latched into the 21071-BA chips due to the 
command driven by the 21071-CA chip in cycle 4. 

7 The memCASJ<3:0> pins remain deasserted until the CAS prechargetime 
is satisfied. This example uses a TCP value of 1. 
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Cycle Description 



8 The21071-CA chip assa"ts memCASJ<8:0> because the CAS prechargetime 
has been satisfied. 

The21071-CA changes the memCmd<3:l> from NOP to RDIMM, indicating 
to the 21071-BA chips that memory data should be latched on the rising 
edge of memCII< of cycle 10. The delay between latching successive read data 
is internally calculated from other programmed parameters. 

The 21071-CA will change the column address to point to the next column 
after the Column address hold time has been satisfied. This example uses a 
CdHold value of 1. 

9 This cycle is similar to cycle 5. 

10 The 20171-CA deasserts memCASJ <3:0>. Data is latched into the 21071-BA 
chips due to the command driven by the 21071-CA chip in cycle 8. 

The 21071-CA keeps memRASJ asserted because it decides to perform the 
next read in page mode. If the 20171-CA chip were to decide not to remain 
in page mode, it would deassert memRASJ in this cycle. 

The 20171-CA begins driving the next row address on memAdr<ll:0> 
because the read transfers have ended. 

The 21071-CA drives the memCmd<3:l>to RESET, indicating to the 
21071-BA chips that their internal counters and pointers should be reset. 

11 The state machine is in idle, but clk2 is low in this cycle. The state machine 
waits until the next cycle to begin, where clk2 is high. This is required to 
synchronize two state machines inside the 21071-CA chip. 

12 The next transaction is begun and is confirmed to be a page mode read. 

The 21071-CA switches the drvMemData pin to its default asserted value, 
causing the 21071-BA chips to turn on their memData drivers. 

13 The memData drivers are turned off in this cycle because the current 
transaction is a read. The 21071-CA begins to drive the first column address. 
This is a wait cycle for the state machine in the 21071-CA. 

14 The column setup time counter is started in this cycle and the 21071-CA 
waits to assert memCAS_l <3:0>. 

15 This cycle is similar to cycle 4. 



5-76 DECchip 21071-CA Transactions and Timing Diagrams 



Figure 5-31 Memory Read Followed by a Page Mode Memory Read 
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5.2.2 Memory Read Followed by a Non-Page Mode Memory Write 

Cycles through 5 are the same as in Section 5.2.1. In cycle 6, memCmd<3:l> 
is RDDly because the read data must be latched after 3 memClk cycles. 

Figure 5-32 shows a memory read followed by a non-page mode memory 
write. 
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Figure 5-32 Memory Read Followed by a Non-Page Mode Memory Write 
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All signals except memData are drawn at DECchip 21 071 -CA driver pin with zero delay. 
Non page Memory Read to Bank is followed by Write to Bank 1 . 
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5.2.3 Memory Write Followed by a Page Mode Memory Write 

The following table describes the cycles for a memory write followed by a page 
mode memory write transaction, as shown in Figure 5-33. 

Cycle Description 

The 21071-CA decides to begin a non-page-mode write in this cycle. 

1 The 21071-CA switches the address on memAdr<ll:0>from the default of 
read row address to write row address. The 21071-BA chips are already 
driving the first write data on the memData pins. The 21071-CA waits in 
this cycle to retrieve programmed information to do the write. 

2 The 21071-CA waits for the address setup time to memRASJ<8:0> (or 
memRASBJ<8:0>) to be satisfied. 

3 The 21071-CA asserts the appropriate memRASJ<8:0> (or memRASBJ<8:0>) 
after waiting for the row address setup time. This example uses a RowSetUp 
value of 0. 

4 The 20171-CA waits for row address hold time to be satisfied. 

5 The 20171-CA commences driving the column address after waiting for the 
row address hold time. This example uses a ROWHOLD value of 1. 

The 21071-CA asserts memWEJ<L:0>in the same cycle as switching from 
row to column address. memWE J<1:0> is held asserted until the end of the 
transaction. 

6 The 20171-CA asserts memCASJ<3:0> after waiting for the column address 
setup time. This example uses a COLSETUP value of 0. 

The 21071-CA changes the memCmd<3:l> from NOP toWRIMM, indicating 
to the 21071-BA chips that the next memory data should be driven on the 
rising edge of memClk of cycle 8. This example uses a WHoldORow value of 
4. 

The 21071-CA will change the column address to point to the next column 
after the column address hold time has been satisfied. This example uses a 
ColHold value of 1, causing the next column address to be driven in cycle 8. 

7 The memCASJ<3:0>pins remain asserted. 

8 The 20171-CA deasserts memCASJ<3:0> after waiting for the CAS assertion 
time. This example uses a WTCAS value of 0. 

The 21071-CA changes memCmd<3:l>to WRDLY_LAST, indicating to 
the 21071-BA chips that this is the last memData transfer and that the 
21071-BA should increment its write buffer cache line pointer after 3 cycles. 

9 The 20171-CA asserts memCASJ<3:0> after waiting for CAS precharge 
time. This example uses a TCP value of 0. 

10 This cyde is similar to cyde 7. 
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Cycle Description 



11 The 20171-CA deasserts memCAS_l <3:0>. 

The 21071-CA keeps memRAS_l asserted because it decides to do the next 
write in page mode. If the 20171-CA chip were to decide not to remain in 
page mode, it would deassert memRASJ (or memRASBJ) in this cycle. 

The 20171-CA begins driving the default read row address on memAdr<ll:0> 
because the write transfers have ended. 

The 21071-CA deasserts memWEJ<l:0>in this cycle. 

12 A page mode write is selected in this cycle. 

13 The 21071-CA switches the address on memAdr<ll:0>from the default of 
read row address to write column address. The 21071-BA chips are already 
driving the first write data on the memData pins. The 21071-CA waits in 
this cycle to retrieve programmed information to do the write. 

The 21071-CA asserts memWEJ<l:0>in this cycle. 

14 The 21071-CA waits for the address setup time to memCA5_l to be satisfied. 

15 This cycle is similar to cycle 6. 
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Figure 5-33 Memory Write Followed by a Page Mode Memory Write 
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All signals except memData are drawn at DECctiip 21 071 -CA driver pin witti zero delay. 
Non page Memory Write to Bank witti LW7 masked is followed by a page hit write. 



5.2.4 Memory Write Followed by a Non-Page Mode Memory Read 

The write portion of the transaction is the same as in Section 5.2.3. The 
difference is in cycle 12 when the write is completed. Because the default 
address sent out on mennAdr<ll:0> is the read address, no extra cycles are 
required to switch the address multiplexer (mux), when a read is selected. 
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The memRAS_l<8:0>for the read can assert as early as cycle 13. Figure 5-34 
shows a memory write followed by a non-page mode memory read. 

Figure 5-34 Memory Write Followed by a Non-Page Mode Memory Read 
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Non Page Memory Write to Bank with LW4 masked is followed by a Read. 
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5.2.5 Memory Refresh 

The following table describes the cycles for a memory refresh transaction, as 
shown in Figure 5-35. 

Cycle Description 

The 21071-CA decides to do a CAS-before-RAS refresh. The address is a 
don't-care during the refresh and continues to point to the default read row 
address. 

1 The 21071-CA asserts all memCASJ<3:0> signals. 

2 The 21071-CA waits to assert memRASJ and memRASBJ. This example 
uses a Ref_Cas2Ras value of 1. 

3 The 21071-CA asserts all memRASJ <8:0> and memRASBJ <8:0> signals. 

4 The 21071-CA waits to deassert memRASJ and memRASBJ. This example 
uses a Ref_RasWidth value of 1. 

5 This cycle is the same as cycle 4. 

6 The 21071-CA deasserts all memRAS_L and memRASBJ signals. This 
example uses a Ref_RasWidth value of 1. The RAS precharge count starts in 
this cycle. This example uses a RAS precharge value (gtr_rp) of 3. 

7 The 21071-CA deasserts memCASJ<3:0> signals in this cycle. 

8-10 The 21071-CA waits for the RAS precharge count (in this example, 3) to 

complete. 

11 The 21071-CA asserts RAS for the next transaction. 
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Figure 5-35 Memory Refresh 



cik2 r 

mClk r 



CYO 



CY1 



CY2 



CY3 



CY4 



CY5 



emAdr<11:0> ~ \ invalid" 



m 

memRAS_L<8:0> 

memRASB_L<8:0> 

memCAS_L<3:0> 



Address not relevant for CAS-RAS refresh 



nemWE_L £J~ 



drvmemData / 
memData ~ — 



- ( '~t X " 



"" X alio 



~ \ alio 







CY6 




CY7 




CY8 




CY9 


^ 


CY10 




CY11 




clk2 


/ 




-^ 


-^ 




"^ 




^ 


_7 


memClk 


/ 


\ 


\ 


-^ 


\ 


"^ 


\ 


\ 


\ 


_7 


memAdr<11 :0> 






X 






Row 






X 


Row nbxt transaction 
































rremRAS L<8:0> 




alii 








all 1 










X 


some RAS 
































memRASB L<8:0> 




all 1 








all 1 












































rremCAS L<3:0> 






X 


1111 












1111 








memWE L 




























drvmemData 
























































memData 





























Note: 

All signals except memData are drawn at DECchip 21071-CA driver pin with zero delay. 
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6 

DECchip 21071-CA Electrical Data 



This chapter includes the following information about the DECchip 21071-CA 
chip: 

• DC Electrical Data 

• AC Electrical Data 

6.1 DC Electrical Data 

This section describes the dc characteristics of the DECchip 21071-CA chip. 

6.1.1 Absolute Maximum Ratings 

Table 6-1 lists the maximum ratings of the DECchip 21071-CA chip. 
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Table 6-1 DECchip 21071-CA Maximum Ratings 



Characteristics IVIinimum Maximum 



Storage temperature 
Operating ambient temperature 

Air flow 

J unction temperature 



Supply voltage with respect to Vss, 
witii reset_l asserted 

Supply voltage witii respect to Vss, 
witii reset I deasserted 



Voltage on any pin witii respect to Vss 



Maximum power: 

(a/dd= 5.25 V IW 

(aCycle =30 ns 

^Ifpm = linear feet per minute 



55°C (-67°F) 


125°C (257°F 


0°C (32°F) 


40°C (104°F 


Ifpm^ 


— 


25°C (77°F) 


85°C (185°F 


0.5 V 


-f6.5 V 


4.75 V 


5.25 V 


0.5 V 


Vdd +0.5V 
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Table 6-2 lists the dc parametric values of the DECchip 21071-CA chip. 
Table 6-2 DC Parametric Values 



Symbol 


Description 


lUlinimum 


lUlaximum 


Units 


Test Conditions 


^ih 


Input high voltage 


2.0 


- 


V 


- 


^il 


Input low voltage 


- 


0.8 


V 


- 


^oh 


Output high voltage 


2.4 


- 


V 


- 


^ol 


Output low voltage 


- 


0.4 


V 


- 


'il 


1 nput leakage current-^ 


-5 


5 


HA 


OV <Vin <Vdd 


'ilpu 


1 nput leakage current^ 


-15 


-100 


iiA 


OV <Vin <Vdd 


'ilpd 


1 nput leakage current^ 


15 


100 


[XA 


OV <Vin <Vdd 


'ol 


Output leakage current 
(tristated) 


-10 


10 


HA 


OV <Vin <Vdd 



^Excluding memPDDIn, scanEnable, vFrame, vRefresh, wideMem, testModeand tristateL. 

^For tristateL, vFrame, and vRefresh. 

^For memPDDIn, scanEnable, testMode, and wideMem. 



6.2 AC Electrical Data 

This section describes the ac characteristics of the DECchip 21071-CA chip. 

6.2.1 Clocks 

The DECchip 21071-CA uses one clock (running at twice the nominal system 
frequency) plus a synchronous phase reference signal to generate five internal 
clock edges. See Figures 6-1 and 6-2, and Tables 6-3 and 6-4 for details 
about DECchip 21071-CA external clock requirements and internal clock phase 
relationships. 

A clock system must meet the requirements in Figure 6-1 and Table 6-4 
to guarantee the proper behavior of the 21071-CA chip's internal logic. The 
21071-CA chip does not specify the maximum skew allowed for external 
transfers to or from the CPU, Bcache PALs, Bcache, 21071-BA chips, or 
21071-DA chip because these skew limits are dependent on module placement 
and routing. A system designer must examine external transfers to determine 
the maximum clock skews allowed between chips. 
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The skew numbers shown in Figure 6-1 and Table 6-4 are given for a 30.0 
ns cycle time. At a longer cycle time, the allowable sl<ew may be increased, 
as long as the given minimum times between clock edges are not violated. 
These skew limits assume that the 21071-CA chip adds another 0.1 ns of 
uncertainty between rising and falling edges due to non-ideal input buffer 
switching thresholds. 



Table 6-3 DECchip 21071-CA Clock AC Characteristics 



Parameter 



Minimum 



Maximum 



Unit 



Note 



System cycle time 
clklx2 period 
clklx2 frequency 
clklx2 rise time 
clklx2 fall time 

clk2ref setup to clklx2 rising 
clk2ref hold from clklx2 rising 



30 


— 


ns 


cin Figure 6-1 


15 


— 


ns 


— 


— 


66 


MHz 


— 


— 


1 


ns 


— 


— 


1 


ns 


— 


0.43 


— 


ns 


Tsu in Figure 6-1 


2.32 


— 


ns 


Th in Figure 6-1 



Figure 6-1 DECchip 21071-CA Clock Skew Requirements 



sysClkOuH 
clk1 



Internal edges: clk1 R 

Internal memClk: 



/ / 



-Tsu- 



clk2R 
memClkR 



.5*c - 0.50 ns min - 
.5'c + 0.50 ns max - 



clklF 
/ / 



clk2F 
memClkR 



clk1R 



.5*c - 1 .25 ns min - 
.5*c + 1 .25 ns max - 



.75*c - 1 .60 ns min - 
.75*c + 1 .60 ns max - 



clk2R 
memClkR 



\ \ 



LJ-03719-TIO 
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Table 6-4 DECchip 21071-CA Clock Skew Limits at clk1x2 Pin 



Parameter 



Example Transfers 



Maximum 


Unit 


Note 


0.50 


ns 


@ Cycle 
30 ns 


1.25 


ns 


@ Cycle 
30 ns 


1.60 


ns 


@ Cycle 
30 ns 


1.60 


ns 


@ Cycle 
30 ns 



clklx2 rising edge to rising edge 

cll<lx2 falling edge to falling 
edge 

clklx2 rising edge to falling edge 
clklx2 falling edge to rising edge 



clklR toclklR, clklR toclklF, 
clklF toclklR, clklF toclklF 

clk2R toclk2R, clk2R toclk2F, 
clk2F toclk2R, clk2F toclk2F 

ClklR toclk2R, clklR toclk2F, 
ClklF toclk2R, clklF toclk2F 

clk2R toclklR, clk2R toclklF, 
clk2F toclklR, clk2F toclklF 



Figure 6-2 DECchip 21071-CA Clock Signals 



sysClkOuH 


/ 




\ 


— — 


/ 




\ 


/ 


Clk1x2 


/ 


\ 


u 


\ 


/ 


\ 


/ 


-^ / 


clk2ref 


/ 




\ 


/ 




\ 


*clk1R 


/ 

gene 


\ 


/ 


\ 


/ 


*clk2R 


/ 


\ 


/ 


\ 




*clk1F 


J. 


\ 


/ 


\ 


*clk2F 


/ 


\ 


_J \ 


•memClkR 
* Internally i 


/ 
irated clocks. 


\ 


/ 


\ 


/ 


\ 


/ \ 

LJ-03456-TI0 



The 21071-CA imposes no requirements on clkl or sysClkOutl. Skew on 
clkl will be constrained by limits imposed by external paths to or from the 
Bcache control PALs. The phase error between sysClkOutl and clklx2 will be 
constrained by limits imposed by external paths to or from the CPU chip. 
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6.2.2 Signals 

Figures 6-3 and 6-4 demonstrate the timing measurements specified in Tables 
6-6 and 6-7. 

Figure 6-3 DECchip 21071-CA Output Delay Measurement 



Input 



Output 1 
Output 2 



Delay_A 



Delay_B 



LJ-03561-TI0 
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Figure 6-4 DECchip 21071-CA Setup and Hold Time lUleasurement 




Set-up Hold 



Valid Signal 




LJ-03562-TI0 



The following ac electrical data is specified with respect to the appropriate edge 
at the clklx2 pin. Both the output delay table and the setup/hold time table 
assume a 1 ns edge rate at the clklx2 pin. 

All outputs drive a 50 pF load. When estimating module delays, you may 
need to replace the 50 pF load delay with a simulated (or calculated) delay. 
The delays for 4 mA and 8 mA drivers driving a 50 pF load are provided in 
Table 6-5. See Table 2-1 for information about the buffer size of every output 
pin. 



Table 6- 


-5 


DECchip 21071-CA Output Buffer Delays 


into a 


50 pF Load 


Type 




Minimum 


Maximum 


Unit 






4 mA 
8 mA 




3.5 
2.3 


7.6 
5.0 


ns 
ns 
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Table 6-6 DECchip 21071-CA AC Characteristics (Valid Delay into a 50 pF 
Load) 



Signal 



Minimum 



Maximum 



Unit 



Reference Edge 



sysData<15:0>^ 

tagAdr<31:17>, 
tagAdr P, 
tagCtlVDP 



5.9 
6.0 



19.1 
21.3 



ns 
ns 



clklR 
clklF 



cpuCAck<2:0>, 

cpuDRAck<2:0>, 

cpuDWSel<l>, 

cpuDlnvReq, 

cpuHoldReq 



4.8 



14.6 



ns 



clklR 



sysDOE, 
sysEarlyOEEn 

sysTagOEEn^ 

sysTagOEEn^ 

sysDataOEEn^ 

sysDataOEEn^ 

sysDataOEEn^ 

sysTagWE, 
sysDataWEEn 

sysDataLongWE 

sysDataALEn 

sysDataAHEn 



4.5 



11.6 



ns 



ClklR 



4.8 


11.5 


ns 


clklF 


4.9 


11.7 


ns 


ClklR 


4.9 


12.3 


ns 


clk2F 


4.9 


12.0 


ns 


ClklF 


4.9 


11.8 


ns 


ClklR 


4.3 


11.6 


ns 


ClklR 


4.5 


11.6 


ns 


ClklF 


4.6 


12.0 


ns 


clk2R 


4.7 


12.0 


ns 


clk2F 



ioGrant, 

ioCAck<l:0>, 

loDataRdy 



3.3 



12.1 



ns 



ClklR 



^ Two cydes are allocated for returning CSR read data. 

^ See Section 2.2.2.2 to determine which measurement is relevant. 

^ See Section 2.2.2.3 to determine which measurement is relevant. 



(continued on next page) 
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Table 6-6 (Cont.) DECchip 21071-CA AC Characteristics (Valid Delay into a 
50 pF Load) 



Signal 


Minimum 


Maximum 


Unit 


Reference Edge 


memAdr<ll:0>'* 


7.4 


16.1 


ns 


clklR 


memAdr<ll:0>^ 


9.1 


24.3 


ns 


clklR 


memAdr<ll:0>® 


5.4 


12.3 


ns 


memClkR 


manAdr<ll:0>^ 


5.4 


13.7 


ns 


memClkR 


memRAS 1<8:0>^, 
memRASBJ<8:0>^ 


4.7 


11.8 


ns 


memClkR 


memRAS 1<8:0>^ 
memRASBJ<8:0>^ 


4.0 


10.3 


ns 


memClkR 


memCASJ<3:0>^ 


4.9 


12.5 


ns 


memClkR 


memCAS_l<3:0>^ 


4.1 


12.5 


ns 


memClkR 


memCASJ<3:0>^° 


15.0 


— 


ns 


memClkR 


memWE I<1:0> 


4.9 


11.9 


ns 


memClkR 



memPDCIk, 
memPDLoad I 



5.1 



15.3 



ns 



clk2R 



memDTOEJ 

memDSF 

sysCmd<2:0> 



4.8 


11.7 


ns 


memClkR 


5.0 


12.5 


ns 


memClkR 


3.3 


11.7 


ns 


ClklR 



* Delay to valid row address for banksets through 7. Row addresses transition on clklR, 
one quarter-cycle before memClkR. Subtract (system cycle/ 4) from the numbers in this row to 
calculate the row address delay from memClkR. 

^ Delay to valid row address for bankset 8. Row addresses transition on clklR, one quarter-cycle 
before memClkR. Subtract (system cycle/ 4) from the numbers in this row to calculate the row 
address delay from memClkR. 

^ Delay on transition from row address to column address. 

~' Delay on transition from column address to subsequent column address. 

® Delay for falling edge of signal. 

5 Delay for rising edge of signal. 

1° Pulse width from rising to falling edge of signal. 



(continued on next page) 
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Table 6-6 (Cont.) DECchip 21071-CA AC Characteristics (Valid Delay into a 
50 pF Load) 



Signal 




Minimum 


lUlaximum 


Unit 


Reference Edge 


subCmdA<l:0>, 




3.3 


14.1 


ns 


clklR 


subCmdB<l:0> 












subCmdCommon 




3.3 


11.8 


ns 


clklR 


syslORead 




3.3 


12.0 


ns 


clklR 


sysReadOW 




3.3 


12.6 


ns 


ClklR 


drvSysData^ 




3.3 


13.4 


ns 


clk2R 


drvSysData^ 




3.3 


13.4 


ns 


clk2F 


drvSysCSR 




3.3 


14.3 


ns 


clk2R 


drvMemData 




4.8 


11.7 


ns 


memClkR 


memCmd<3:l> 




3.3 


12.4 


ns 


clk2R 


^ Delay for falling edge of 


signal. 










^ Delay for rising edge of ! 


signal. 










Table 6-7 DECchip 21071-CA AC Characteristics (Setup/Hold Time) 


Signal 




Setup 


Hold Unit 


Reference Edge 



sysData<15:0> 
sysAdr<33:5>^ 
sysAdr<33:5>^ 



0.4 


4.4 


ns 


clk2F 


12.2 


4.3 


ns 


ClklR 


9.7 


4.3 


ns 


clklF 



tagAdr<31:17>, 

tagAdrP, 

tagCtlVDP 



-0.4 



4.4 



ns 



clklF 



cpuCWIviask<7:0> 



7.7 — 

— 2.2 



ns 
ns 



ClklR 
ClklF 



^ For CPU transactions only. 
2 For DMA transactions only. 



(continued on next page) 
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1.8 


3.4 


ns 


clklF 


12.0 


— 


ns 


clklR 


-0.8 


3.9 


ns 


clklF 



Table 6-7 (Cont.) DECchip 21071-CA AC Characteristics (Setup/Hold Time) 
Signal Setup Hold Unit Reference Edge 

cpuCReq<2:0> 
cpuCReq<2:0>^ 
cpuHoldAck 



ioRequest<l:0>, -0.1 3.4 ns clklF 

ioCmd<2:0> 



memPDDln -0.8 4.4 ns clk2R 

^ In initial cycle of transaction; referenced to thedklR which receives sysAdr. 
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7 



DECchip 21071-CA Power-Up and 

Initialization 



This chapter describes the behavior of the DECchip 21071-CA chip on power-up 
and assertion of resetj. It also describes the system level requirements and 
the various registers that have to be initialized after reset_l is deasserted. 

7.1 Power-Up 

On power-up, the reset_l input of the DECchip 21071-CA chip should be 
asserted. It should be kept asserted until the system clocks are up and 
running for 20 cycles. 

7.2 Internal Reset 

The assertion and deassertion of the reset_l pin on the module is asynchronous 
to the DECchip 21071-CA. An internal reset signal is generated from reset_l 
which asserts asynchronously as soon as reset_l is asserted, but deasserts 
synchronously. Due to the synchronous deassertion of the internal reset, the 
DECchip 21071-CA requires that no external transaction should start until 10 
system clock cycles after the deassertion of reset_l. 

7.3 State of Pins on Reset Assertion 

The following are general rules and requirements for the behavior of the 
DECchip 21071-CA chip pins during reset: 

• All input only control signals (except the clocks and reset_l) must be in the 
deasserted state as long as reset is asserted. 

• All output only signals are deasserted. 

• All bidirectional signals are tristated. 
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The exceptions to these rules are as follows: 



sysDataOEEn and sysTagOEEn are asserted synchronously after the 
assertion of reset_l and are deasserted as soon as reset_l deasserts (without 
waiting for the deassertion of synchronous internal reset). These signals 
keep sysData<127:0>, sysCheck<7:0>, tagAdr<32:17>, and the tag control 
signals driven during reset. 

The presence detect logic activates on the deassertion of internal reset. For 
details of the operation, refer to Section 3.2.7. 

drvMemData is asserted by the DECchip 21071-CA so that 
memData<127:0>are driven by the 21071-BA during reset. 

Note 



In all cases, the assertion of tristate_l will override the assertion of 
reset_l. That is, if tristate_l is asserted during reset, all the outputs of 
the DECchip 21071-CA go to their H igh-Z state. 

If reset_l is still asserted when tristatej deasserts, the signals return 
to the normal reset state described previously. 



7.4 Configuration after Reset Deassertion 



Software must initialize the following registers in the DECchip 21071-CA after 
the deassertion of reset_l: 

General control register 

Tag enable register 

Bankset configuration registers 

To determine memory configuration, see Section 4.5. 

Bankset base address registers 

Bankset timing registers A and B 

To determine the programmed values of these registers, see Section 4.6. 

Global timing register 

Refresh timing register 
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The deassertion of internal reset causes the DECchip 21071-CA to commence 
doing refreshes. IMost DRAIMs require that they be refreshed 8 times before 
any write or read transactions are addressed to them. The DECchip 21071-CA 
does not guarantee this. Software has to ensure that memory reads and writes 
are not performed until the eight refreshes are completed. The refresh rate can 
be increased using two mechanisms: 

1. Software can use the force_Ref bit in the refresh timing register to generate 
back-to-back refreshes. I n this case, software has to write the force_Ref 
bit, wait 10 cycles for it to be cleared (indicating that one refresh has been 
completed), and then set it again for the next refresh. 

2. Software can also choose to set refj nterval in the refresh timing register 
at its minimum value of 64 memClk cycles (refjnterval =1). This will 
cause refreshes to happen every 32 system clock cycles. 

After initialization of the registers, the Bcache and memory must be written 
with good parity or ECC, otherwise errors may prevent correct operation. 
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Part II 



Part II contains six chapters that provide information about the DECchip 
21071-DA chip. The following table provides a brief description of each 
chapter: 



Chapter 



Description 



8 

9 

10 

11 

12 

13 



Describes the DECchip 21071-DA pin signals. 

Describes the DECchip 21071-DA architecture. 

Describes the DECchip 21071-DA control and status registers. 

Describes the transaction flows for the 21071-DA chip. 

Describes the DECchip 21071-DA electrical requirements. 

Describes the behavior of the DECchip 21071-DA chip during power-up. 



8 

DECchip 21 071 -DA Pin Descriptions 



This section provides a listing and description of pin signals for the DECchip 
21071-DA chip. 

The 21071-DA chip has three major bus interfaces: 

• sysBus 

• Peripheral Component Interconnect (PCI) 

• epIBus interface 

8.1 DECchip 21071-DA Pin List 

Table 8-1 lists the pin signals grouped by function. The information in the 
Type column identifies a signal as input (I), output (O), or bidirectional (B). 
The Buffer Strength column indicates the buffer drive strength. 

All output and bidirectional pins, except pTestout, can betristated. 
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Table 8-1 DECchip 21 071 -DA Pin List 



Signal Name 



Quantity Type 



Buffer 
Strengtii 



Function 



sysBus Signals 
(52 Total) 



sysAdr<33:5> 


29 


cpuCReq<2:0> 


3 


cpuCWM asl«7:0> 


8 


cpuHddAck 


1 


ioCmd<2:0> 


3 



ioCAck<l:0> 2 

loDataRdy 1 

ioLineSel<l:0> 2 

ioRequest<l:0> 2 

JoGrant 1 



4 ma Address bus 

— Cycle request 

— Cycle write mask 

— Hold acknowledge 

8 ma Command for DMA 

transactions; ac- 
knowledgment for I/O 
transactions. 

— Acknowledgment from 
the21071-CA chip on 
DMA transactions 

— Indicates that the 
requested data is loaded 
into the 21071-BA chips 
and can be extracted 

4 ma Selects which cache line 

should be read/written 
from thesysBus 

8 ma Request for DMA 

transactions on sysBus 

— Indicates that thesysBus 
has been granted to the 
21071-DA chip 

(continued on next page) 
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Table 8-1 (Cont.) DECchip 21071-DA Pin List 



Signal Name 


Quantity 


Type 


Buffer 
Strength 


Function 


PCI Signals 
(47 Total) 


AD<31:0> 


32 


B 


12/16 ma 


PCI data and address 
lines 


CBEJ<3:0> 


4 


B 


12/16 ma 


Bus command and byte 
enable 


F rameL 


1 


B 


12/16 ma 


Cycle frame 


TrdyL 


1 


B 


12/16 ma 


Target ready 


IrdyL 


1 


B 


12/16 ma 


Initiator ready 


StopL 


1 


B 


12/16 ma 


Stop the current 
transaction 



LockL 



Indicates an atomic 
operation tliat may take 
multiple transactions to 
complete 



DevselL 




B 


12/16 ma 


Device select 


Par 




B 


12/16 ma 


Parity bit 


PerrL 




B 


12/16 ma 


Parity error 


ReqL 







12/16 ma 


Bus request 


GntL 




1 


— 


Bus grant 


pClk 




1 


— 


PCI clock 


PCI Sideband Signals 
(2 Total) 


MemReql 


1 


1 


— 


Clears path from PCI to 
memory 


M emAckl 


1 





12/16 ma 


Acknowledgment that 
path for PCI to memory 
has been cleared by the 
21071-DA chip 



(continued on next page) 
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Table 8-1 (Cont.) DECchip 21071-DA Pin List 



Signal Name 



Buffer 
Quantity Type Strength 



Function 



epiBus Signals 
(46 Total) 



epiData<31:0> 


32 


B 


4 ma 


1 nterchip data for both 
DMA and I/O operations 


epiBEnErr<3:0> 


4 


B 


4 ma 


epiData byte enable 


epiOWSel 


1 





4 ma 


Selects which octaword 



epi Lines el<l:0> 



epiSelDMA 



epiFromlOB 



epiEnable<3:0> 



epiLinelnval 



of the cache line will 
be transferred on the 
epiData bus 

4 ma Selects which cache line 

will be transferred on 
the epi Data bus 

4 ma Selects which buffer 

(I/O or DMA) will 
be transferred on the 
epiData bus 

4 ma Selects the next epiData 

transfer from the 
21071-DA chip to the 
21071-BA chips 

4 ma Oualifies epiData control 

signals and enables 
output drivers 

4 ma Clears all byte valid bits 

in the current line of the 
DMA write buffer 

(continued on next page) 
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Table 8-1 (Cont.) DECchip 21071-DA Pin List 



Signal Name 



Quantity Type 



Buffer 
Strength 



Function 



Miscellaneous/Clock Signals 
(4 Total) 



IntHwO 



resetL 
clklx2 
clk2ref 



4 ma Interrupt to the DECchip 

21064 microprocessor 
indicating that the 
21071-DA chip has 
detected an abnormal 
condition 

1 — 21071-DA chip reset 

I — Clocl< input 

I — Phase reference for 

cll<lx2 



Test Signals 
(4 Total) 



testM ode 


1 


scanEn 


1 


tri state 1 


1 



pTestout 



4 ma 



Test mode select 

Scan Enable for chip 
testing 

Tri states all output and 
bidirectional pins for 
chip and module testing 

Parametric NAND tree 
output 



Pin Totals 

Total signal pins: 

Total power and ground pins: 

Total pins: 



155 
53 

208 
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8.2 DECchip 21071-DA Signal Descriptions 

This section provides pin signal information, including a description of the 
signal, the clock edge on which the signal changes, and rules about signal 
usage during various sysBus transactions. 

Signal descriptions are grouped by function and correspond to the pin list 
(Table 8-1). 

8.2.1 sysBus Signals 

This section describes the sysBus signals. 

8.2.1.1 sysAdr<33:5> 

Signal Type: 21071-CA Input, CPU output, 21071-DA bidirectional 
Input Sampling Clock Edge: clklR 
Output Clock Edge: clklR 

sysAdr<33:5> signals contain the cache line address of sysBus transactions; 
sysAdr<33:32> indicates the address quadrant. 

sysAdr<33:5> are driven by the CPU on CPU-initiated transactions and by the 
21071-DA chip on DMA transactions. 

• On CPU-initiated transactions, the cache line address is expected to be held 
on the bus from the command cycle through the terminate/acknowledge 
cycle. 

• On DMA transactions, the 21071-DA chip drives the address from the 
timecpuHoldAck and ioGrant are asserted until ioGrant or cpuHoldAck is 
deasserted. sysAdr<33:5> are valid when ioCmd<l:0> carry a valid DMA 
command. 

8.2.1.2 cpuCReq<2:0> 

Signal Type: 21071-DA Input 
Signal Source: CPU 
Output Clock Edge: clklF 

Whenever the DECchip 21064 microprocessor wants to initiate an external 
transaction, it puts a transaction type code onto cpuCReq<2:0>. Table 8-2 lists 
the encodings for the different transaction types. 
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Table 8-2 CPU-Initiated Transaction Encodings 



cpuCReq<2:0> 


Transaction 


000 


Idle 


001 


Barrier 


010 


Fetch 


on 


FetchM 


100 


Read block 


101 


Write block 


no 


LDx L 


111 


STx C 



The transaction types must be held on cpuCReq<2:0> until the end of the 
transaction. The 21071-DA chip does not latch these signals. 

Transactions on cpuCReq<2:0>are ignored by the 21071-DA chip when the 
bus is granted to 21071-DA chip. That is, from the cycle following ioGrant 
assertion to the cycle after cpuHoldAck and ioGrant deassertion at the end of 
the DMA transaction. 

8.2.1.3 cpuCWMask<7:0> 

Signal Type: 21071-DA I nput 

Signal Source: CPU 

Input Sampling Clock Edge: clklF 

cpuCWMask<7:0> signals are used on CPU-initiated read block and write block 
transactions. These signals carry different information on both read and write 
block transactions. 

On CPU write block and CPU STx_C transactions, cpuCWMask<7:0> carry 
the longword mask for the whole cache line. An asserted cpuCWMask signal 
indicates that the corresponding longword from the cache line is valid and 
should be written. 

On CPU read block and CPU LDx_L transactions, the cpuCWMask<7:0> sig- 
nals carry additional information about the read transaction. cpuCWMask<l:0> 
carry address bits <4:3>, thereby indicating the address of the actual quadword 
to be returned. This information is used to implement quadword granularity to 
I/O space. 
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8.2.1.4 cpuHoldAck 



Signal Type: 21071-DA Input 

Signal Source: CPU 

Input Sampling Clock Edge: clklF 

When cpuHoldAck is asserted in conjunction witl^ ioGrant, tine 21071-DA chip 
drives sysAdr<33:5> in the following cycle and may send out a valid DMA 
command on ioCmd<2:0>. 



8.2.1.5 ioCmd<2:0> 



Signal Type: 21071-DA Output 
Signal Destination: 21071-CA 
Input Sampling Clock Edge: clklF 
Output Clock Edge: clklR 

The 21071-DA chip asserts ioCmd<2:0>to request an action by the 21071-CA 
chip. When the 21071-DA chip owns the sysB us, ioCmd<2:0> signals are used 
to request a DMA transaction. When the CPU owns the bus, ioCmd<2:0> is 
used to request assertion of the cpuCAck and cpuDRAck signals. Table 8-3 
lists the encodings for ioCmd<2:0>. 

Table 8-3 ioCmd<2:0> Encodings 

ioCmd<2:0> CPU Owns sysBus 21071-DA Chip Owns sysBus 

000 Idle Idle 

001 CIrLock Flush 
010 cpuDRAck OK_NCACHE_NCHK Write 

Oil cpuDRAck OK_N CACHE Write masked 

100 cpuCAck OK Read 

101 cpuCAck HARD_ERROR Read burst 

110 cpuCAck SOFT_ERROR Read wrapped 

111 cpuCAck STxC_FAI L Read burst wrapped 



8.2.1.6 ioCAck<1:0> 



Signal Type: 21071-DA Input 
Signal Source: 21071-CA 
Input Sampling Clock Edge: clklF 
Output Clock Edge: clklR 
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The 21071-CA chip asserts ioCAck<l:0>to acknowledge a DMA transaction. 
ioCAck<l:0> indicates that the DIM A transaction has been completed. If any 
error occurs during the transaction, an error response is sent. Table 8-4 lists 
the encodings for ioCAck<l:0>. 

Table 8-4 ioCAck<1 :0> Encodings 
ioCAck<1 :0> Function 



00 


Idle 


01 


Reserved/unused 


10 


DMA cycle acknowledge 


11 


DMA cycle error 



8.2.1.7 ioDataRdy 



Signal Type: 21071-DA I nput 
Signal Source: 21071-CA 
Input Sampling Clock Edge: clklF 
Output Clock Edge: clklR 

When ioDataRdy is sampled asserted, the 21071-DA chip assumes that read 
data is available on epiData<31:7> in the following cycle. 

If the 21071-DA chip samples ioCAck<l:0> = DMA cycle acknowledge without 
a prior assertion of IoDataRdy, it assumes that all the data will be available 
in the 21071-BA chips on the second subsequent cycle, and it does not wait for 
IoDataRdy to assert. 



8.2.1.8 ioLineSel<1:0> 



Signal Type: 21071-DA Output 
Signal Destination: 21071-BA 
Output Clock Edge: clklR 
Input Sampling Clock Edge: clk2F 

ioLineSel<l:0> is driven by the 21071-DA chip to the 21071-BA chips. During 
DMA read transactions, ioLineSel<l:0> indicates the DMA read buffer line that 
should be loaded. During DMA write transactions, ioLineSel<l:0> indicates 
the DMA write buffer line that has to be written to memory. 

When the 21071-DA chip does not own the sysBus, the 21071-DA chip uses 
ioLineSel<l:0>to select the cache line of the I/O write buffer that should be 
loaded with CPU I/O write data. 
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8.2.1.9 ioRequest<1:0> 



Signal Type: 21071-DA Output 
Signal Destination: 21071-CA 
Input Sampling Clock Edge: clklF 
Output Clock Edge: clklR 

The 21071-DA chip asserts ioRequest<l:0> to request ownership of 
sysAdr<33:5>to perform a DIMA transaction. ioRequest<l:0> is acknowledged 
by the 21071-CA, using ioGrant. When a DIMA transaction is started, 
ioRequest<l:0> is returned to idle in the cycle after ioCmd, if no further 
DMA transactions are required. 

The 21071-DA chip uses the DMA request encoding on most DMA read and 
write transactions except in the following situations: 

• The 21071-DA uses an atomic request to perform a DMA read prefetch. 

• The 21071-DA uses an atomic request to perform a DMA read or write 
transaction following a scatter/gather map read. 

• The 21071-DA chip uses the preempt request in order to flush the DMA 
write buffer on memory barriers. 

• The 21071-DA chip uses the preempt request to prevent deadlock situations 
when an I/O transaction is stalled on the sysBus and a memory read 
targeted to the 21071-DA happens on the PCI , or when the write buffer is 
full. 

Table 8-5 lists the encodings for ioRequest<L:0>. 

Table 8-5 ioRequest<1:0> Encodings 
ioRequest<1 :0> Function 



00 


Idle 


01 


DMA preempt request 


10 


DMA request 


11 


DMA atomic request 



8.2.1.10 ioGrant 



Signal Type: 21071-DA Input 
Signal Source: 21071-CA 
Output Clock Edge: clklR 
Input Sampling Clock Edge: clklF 



8-10 DECchip 21071-DA Pin Descriptions 



The assertion of ioGrant indicates to the 21071-DA chip that it has won 
ownership of the sysBus. The 21071-DA chip does not begin any new CPU 
transactions unless both ioGrant and cpuHoldAcl< are asserted. If the 
21071-DA chip samples ioGrant deasserted in any cycle, it turns off its sysAdr 
drivers in the next clklR. 

The 21071-DA chip uses the ioGrant in combination with cpuHoldAcl< to 
determine if cpuCReq<2:0> should be ignored. 

8.2.2 PCI Signals 

For a detailed description of PCI interface pins, see the PCI Local Bus 
Specification 2.0. Table 8-6 provides a translation between the 21071-DA chip 
pin names and PCI specification signal names. 



Table 8-6 Translation of 21071-DA Pin Names to PCI Signal Names 



21071-DA Pin Name 



PCI Signal Name 



AD<31:0> 

CBEJ<3:0> 

F rameL 

TrdyL 

IrdyL 

StopL 

LockL 

DevselL 

Par 

PerrL 

ReqL 

GntL 

pClk 



AD <31:0> 

C/BE#<3:0> 

FRAME* 

TRDY# 

IRDY# 

STOP* 

LOCK* 

DEVSEL* 

PAR 

PERR* 

REO* 
GNT* 
CLK 



8.2.2.1 AD<31:0> 



Signal Type: Bidirectional (21071-DA, PCI devices) 
Input Sampling Clock Edge: pClkR 
Output Clock Edge: pClkR 

This signal indicates multiplexed PCI address and data bus. During an 
address phase of a transaction, AD<31:0> contains a physical byte address. 
During subsequent data phases, AD<31:0> contains data. 



DECchip 21071-DA Pin Descriptions 8-11 



A PCI bus transaction consists of one or two address phases followed by one or 
more data phases. The 21071-DA chip supports reads and writes and may act 
as initiator or target of a transaction on the bus. 



8.2.2.2 CBE l<3:0> 



Signal Type: Bidirectional (21071-DA, PCI devices) 
Input Sampling Clock Edge: pCII<R 
Output Clock Edge: pClkR 

This signal communicates multiplexed bus command and byte enables. During 
an address phase of a transaction, CBE_I<3:0> contains the bus command that 
defines the type of PCI transaction. During data phases, CBE_I<3:0> contains 
byte enables dictating which byte lanes carry valid data. CBE_l<0> applies to 
byte 0, CBE_I<3> applies to byte 3. 



8.2.2.3 FrameL 



Signal Type: Bidirectional (21071-DA, PCI devices) 
Input Sampling Clock Edge: pClkR 
Output Clock Edge: pClkR 

FrameL is driven by the initiator of the transaction to indicate the beginning 
and duration of an access on the PCI bus. FrameL assertion indicates the 
beginning of an access. While FrameL is asserted, data transfers continue. 
FrameL deassertion indicates the final data phase. The 21071-DA chip samples 
FrameL as an input and also drives FrameL when acting as the initiator of a 
transaction on the PCI bus. 



8.2.2.4 TrdyL 



Signal Type: Bidirectional (21071-DA, PCI devices) 
Input Sampling Clock Edge: pClkR 
Output Clock Edge: pClkR 

This signal indicates the target agent's ability to complete the current data 
phase of a transaction on the PCI bus. The 21071-DA chip drives TrdyL 
when acting as a target on the PCI bus and samples TrdyL when acting as an 
initiator on the PCI bus. 
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8.2.2.5 IrdyL 



Signal Type: Bidirectional (21071-DA, PCI devices) 
Input Sampling Clock Edge: pClkR 
Output Clock Edge: pClkR 

This signal indicates the initiator's ability to complete the current data phase 
of a transaction on the PCI bus. The 21071-DA chip drives IrdyL when acting 
as an initiator on the PCI bus and samples IrdyL when acting as a target on 
the PCI bus. 



8.2.2.6 StopL 



Signal Type: Bidirectional (21071-DA, PCI devices) 
Input Sampling Clock Edge: pClkR 
Output Clock Edge: pClkR 

This signal indicates that the current target is requesting the bus initiator to 
stop the current transaction on the PCI bus. The 21071-DA chip may drive 
StopL when acting as a target on the PCI bus, and it samples StopL when 
acting as an initiator on the PCI bus. 



8.2.2.7 LockL 



Signal Type: 21071-DA I nput 

Signal Source: PCI Devices 

Input Sampling Clock Edge: pClkR 

LockL indicates an atomic operation that may require multiple transactions 
to complete. The 21071-DA may be locked, but it will never request a lock. 
The 21071-DA treats the entirety of system memory as a single resource for 
the purposes of PCI exclusive access. The 21071-DA chip samples LockL when 
acting as a target on the PCI bus. 



8.2.2.8 DevselL 



Signal Type: Bidirectional (21071-DA, PCI devices) 
Input Sampling Clock Edge: pClkR 
Output Clock Edge: pClkR 

The 21071-DA chip asserts DevselL through positive decoding of the address 
on AD<31:0>. The 21071-DA chip asserts DevselL when it is accepting a 
transaction for system memory. The 21071-DA chip samples DevselL when 
it is acting as an initiator on the PCI bus, and it expects DevselL to be 
asserted within five cycles of FrameL assertion. Otherwise, the transaction 
is terminated with an initiator abort. 
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8.2.2.9 Par 



Signal Type: Bidirectional (21071-DA, PCI devices) 
Input Sampling Clock Edge: pClkR 
Output Clock Edge: pClkR 

The Par signal is even parity, calculated on 36 bits comprised of AD<31:0>and 
CBE_I<3:0>. The Par signal is generated for all address and data phases and 
is valid one clock cycle after valid data or address is driven on AD<31:0>. The 
Par signal is driven and tristated identically to AD<31:0>, except that it is 
delayed one clock cycle. The Par signal is driven by the 21071-DA chip when 
acting as an initiator during address phases and write data phases. The Par 
signal is driven by the 21071-DA chip when acting as a target during read data 
phases. The Par signal is sampled as an input during all address phases and 
when acting as a target during write data phases. 



8.2.2.10 PerrL 



Signal Type: Bidirectional (21071-DA, PCI devices) 
Input Sampling Clock Edge: pClkR 
Output Clock Edge: pClkR 

The PerrL signal is asserted when a data parity error is detected, and it 
corresponds to Par driven one clock cycle earlier. The 21071-DA chip may 
assert PerrL when it detects a write data parity error when acting as a target, 
or when it detects a read data parity error when acting as an initiator. 



8.2.2.11 ReqL 



Signal Type: 21071-DA Output 
Signal Destination: PCI Arbiter 
Output Clock Edge: pClkR 

The 21071-DA chip asserts ReqL to indicate to the bus arbiter that it wants to 
use the PCI bus. 



8.2.2.12 GntL 



Signal Type: 21071-DA Input 

Signal Source: PCI Devices 

Input Sampling Clock Edge: pClkR 

When GntL is asserted, it indicates to the 21071-DA chip that access to the 
PCI bus is granted. The 21071-DA chip may start a transaction as soon as 
GntL is asserted and the bus is idle. 
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8.2.2.13 pClk 

Signal Type: 21071-DA I nput 
Signal Source: External Logic 

The pClk signal provides timing for all transactions on the PCI bus. All PCI 
bus inputs are sampled on the rising edge of pClk, and all PCI bus outputs are 
driven from the rising edge of pClk. Frequencies supported by the bridge range 
from to 33 megahertz. 

8.2.3 PCI Sideband Signals 

This section describes the PCI sideband signals. 

8.2.3.1 MemReql 

Signal Type: 21071-DA I nput 
Signal Source: ISA/EISA bridge chip 
Input Sampling Clock Edge: pClkR 

This signal is asserted by ISA/EISA bridge chips to indicate that an ISA/EISA 
device requires guaranteed access time (2.1 fis) to main memory. Refer to 
Section 9.4.3 for details. This is a PCI sideband signal. 

8.2.3.2 MemAckl 

Signal Type: 21071-DA Output 
Signal Destination: External logic 
Input Clock Edge: pClkR 

This signal is asserted by the 21071-DA chip to indicate that guaranteed 
access time can be achieved on each subsequent PCI transaction directed 
toward main memory which is not retried by the 21071-DA chip. This is a PCI 
sideband signal. 

8.2.4 epIBus Signals 

This section describes the epIBus signals. 

8.2.4.1 epiData<31 :0> 

Signal Type: Bidirectional (21071-BA, 21071-DA) 

Output Clock Edge: clklR 

Input Sampling Clock Edge: clk2F 

epiData<31:0> is a 32-bit bidirectional bus which connects the 21071-DA and 
21071-BA chips. epiData<31:0> are driven on clklR and is tristated on clk2F. 
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8.2.4.2 epiBEnErr<3:0> 

Signal Type: Bidirectional (21071-BA, 21071-DA) 

Output Clock Edge: clklR 

Input Sampling Clock Edge: dk2F 

epiBEnErr<3:0> is timed with epiData<31:0>. During epiBus transfers to the 
21071-BA chips, this field indicates which bytes of the longword on the epiData 
bus are valid. When an epiBEnErr bit is asserted, the corresponding byte is 
valid. The byte enable is used for DMA write transfers and is ignored on I/O 
read transfers. 

During epiBus transfers from the 21071-BA chip DMA read and I/O write 
buffers, epiBEnErr<0> is asserted if the longword being sent on epiData 
contains a parity error or uncorrectable ECC error. epiBEnErr<l>is asserted 
if the longword being sent on epiData contained a correctable ECC error. 
Table 8-7 lists the epiBEnErr functions. 

Table 8-7 epiBEnErr Functions 

Signal Transfers to 21071-BA Transfers from 21071-BA 

epiBEnErr<0> epiData<7:0> byte enable DMA read I/O write uncor- 

rectable error (this longword) 

epiBEnErr<l> epiData<15:8> byte enable DMA read 1/0 write corrected 

error (this longword) 

epiBEnErr<2> epJData<23:16> byte enable Reserved 

epiBEnErr<3> epiData<31:24> byte enable Reserved 

8.2.4.3 epiAdr Signals 

epiOWSel, epiLineSel<l:0>, epiSelDMA, epIFromlOB, epiEnable<3:0>, and 
epiLinelnval are collectively referred to as the epiAdr bus. All these signals 
are set up one cycle prior to each epiData transfer to address a particular 
longword within the 21071-BA chip. A detailed description of each signal 
follows. 



Note 



epiEnable<3:0>, epiOWSel, epiLineSel, epiFromlOB, and epiSelDMA 
collectively address the contents of the 21071-BA chips. In a 
synchronous fashion, these address signals select data to be transferred 
in the subsequent cycle. 
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8.2.4.3.1 epiOWSel 

Signal Type: 21071-DA Output 

Signal Destination: 21071-DA, 21071-BA 

Output Clock Edge: clklR 

Input Sampling Clock Edge: dk2F 

epiOWSel is driven by the 21071-DA chip to the 21071-BA chips on epiBus 
transfers. It is asserted to select the upper octaword within the current 
hexaword cache line and is to be read or written using the epIData bus. 
Table 8-8 lists the longword selection. 

Table 8-8 Longword Selection 

Longword 21071-BA 

Desired Chip Number epiOWSel epiEnable<3:0> 

0001 
0010 
0100 
1000 
0001 
0010 
0100 
1000 

■"^The number in parenthesis indicates tine 21071-BA cliip number wlien four 21071-BA cliips are 
used in tine system. 



8.2.4.3.2 epiLineSel<1:0> 

Signal Type: 21071-DA Output 
Signal Destination: 21071-BA 
Output Clock Edge: clklR 
Input Sampling Clock Edge: clk2F 

epiLineSel<l:0> is driven by the 21071-DA chip to the 21071-BA chips. This 
field selects which cache line is sent from the DMA read and I/O write buffer 
to the 21071-DA chip or from the 21071-DA chip to the DMA write buffer 
using the epiData bus. This signal is ignored on 21071-DA to I/O read buffer 
transfers. 



LW 








LW 1 


1 





LW 2 


0(2)1 





LW 3 


1(3)1 





LW 4 





1 


LW 5 


1 


1 


LW 6 


0(2)1 
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1(3)1 


1 
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8.2.4.3.3 epiSelDMA 

Signal Type: 21071-DA Output 
Signal Destination: 21071-BA 
Output Clock Edge: clklR 
Input Sampling Clock Edge: clk2F 

The epiSelDM A signal is asserted by tine 21071-DA cinip to indicate to the 
21071-BA chips that the 21071-DA chip is performing a DIMA transfer (to the 
DIMA write buffer). When epiSelDIMA is deasserted, the 21071-DA chip is 
performing an I/O transfer (to the I/O read buffer). epiSelDM A is used to select 
the transfer, as shown in Table 8-9. 

8.2.4.3.4 epiFromlOB 

Signal Type: 21071-DA Output 
Signal Destination: 21071-BA 
Output Clock Edge: clklR 
Input Sampling Clock Edge: clk2F 

The epiFromlOB signal is asserted by the 21071-DA chip to the 21071-BA 
chips to indicate that the 21071-DA chip is performing a transfer from the 
21071-DA chip and to the 21071-BA chips. When epiFromlOB is driven 
low, the 21071-DA chip is performing transfer from the 21071-BA chips to 
the 21071-DA chip. epiFromlOB is used to select the transfer, as shown in 
Table 8-9. 

8.2.4.3.5 epiEnable<3:0> 

Signal Type: 21071-DA Output 
Signal Destination: 21071-BA 
Output Clock Edge: clklR 
Input Sampling Clock Edge: clk2F 

The epiEnable<3:0> signals are asserted by the 21071-DA to the 21071- 
BA to indicate that the 21071-DA is performing an epIBus transfer. When 
epIEnable is driven low, the epIData and epiBus control signals are ignored 
by the 21071-BA chips. Each bit of epiEnable<3:0> corresponds to one of four 
longwords. Table 8-9 lists the epiBus interface functions. 
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Table 8-9 21071-BA epiBus Interface Function 



epiEnable 


epiFromlOB 


epiSelDMA 


Function 





X 


X 


No action except for possible line 
invalidate; epiData tristated. 


1 





X 


The DMA read and I/O write buffer is 
driven onto epiData. 


1 


1 





epiData is loaded into the I/O read 
buffer. 


1 


1 


1 


epiData is loaded into the DMA write 
buffer. 



8.2.4.3.6 epILinelnval 

Signal Type: 21071-DA Output 
Signal Destination: 21071-BA 
Input Sampling Clock Edge: clk2F 
Output Clock Edge: clklR 

epILinelnval Is asserted during 21071-DA to 21071-BA transfers to Indicate 
that the cache line being loaded should be Invalidated. All byte enables for that 
line must be cleared. For the invalidate to take place, epIFromlOB Is asserted. 
(epIEnable must be Ignored.) epILinelnval Is asserted by the 21071-DA chip 
when the first longword of data Is loaded Into a new cache line from epiData. 

8.2.4.4 Miscellaneous/Clock Signals 

This section describes the miscellaneous and clock signals. 

8.2.4.4.1 intHwO 

Signal Type: 21071-DA Output 
Signal Destination: External logic 
Output Clock Edge: clklF 

The IntHwO Interrupt pin is an output from the 21071-DA chip and Is 
connected to one of the lrq<5:0> pins of the DECchIp 21064 microprocessor 
through the interrupt control/configuration PAL. This signal is asserted when 
the 21071-DA chip detects certain errors in the transactions it processes. 
IntHwO Is kept asserted until all such error conditions are cleared. 
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8.2.4.4.2 resetL 

Signal Type: 21071-DA I nput 
Signal Source: External logic 

Input Clock Edge: Asynchronous on assertion, clklR on deassertion 

Assertion of resetL sets all internal logic and state machines in the 21071-DA 
chip to their initialized states. 

8.2.4.4.3 clk1x2 

Signal Type: 21071-DA Input 
Signal Source: Clock generator 

clklx2 is a clock input which supplies a clock at twice the frequency of the 
DECchip 21064 sysClkOutl, with a minimum period of 15 ns, and a 50% duty 
cycle. 

8.2.4.4.4 clk2ref 

Signal Type: 21071-DA Input 
Signal Source: Clock generator 

clk2ref is a signal input which is low when the assertion of clklx2 corresponds 
to the assertion of sysClkOutl. The received signal must be set up to the 
assertion of clklx2. 

8.2.4.5 Test Signals 

This section describes the test signals. 

8.2.4.5.1 testMode 

Signal Type: 21071-DA I nput 

Signal Source: Test logic 

Input Clock Edge: Asynchronous 

Assertion of testMode places the chip into a mode for chip testing. testMode 
is intended to be used only during chip testing and must be tied low during 
normal system operation. 

testMode has a weak internal pull down and a Schmitt trigger input. 
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8.2.4.5.2 scanEn 

Signal Type: 21071-DA I nput 
Signal Source: Test logic 

Assertion of scanEn places all internal flops in their scan state. scanEn is 
intended to be used only during chip testing and must be tied low during 
normal system operation. 

scanEn has a weak internal pull down and a Schmitt trigger input. 

8.2.4.5.3 tristatej 

Signal Type: 21071-DA I nput 
Signal Source: External logic 
Input Clock Edge: Asynchronous 

Assertion of this signal tristates all output and bidirectional drivers. tristate_l 
is intended to be used only during chip testing and power-up. 

tristate_l has a weak internal pull up and a Schmitt trigger input. 

8.2.4.5.4 pTestout 

Signal Type: 21071-DA Output 
Signal Source: Test logic 
Output Clock Edge: Flow through 

The pTestout signal contains the output from the parametric NAND tree, as 
required for testability. The testMode signal must be asserted for pTestout to 
be valid. pTestout is intended for use only during chip testing. 

8.3 DECchip 21071-DA Pin Assignment 

The DECchip 21071-DA chip is a 208-pin plastic quad flat pack (PQFP). 
Figure 8-1 shows the signal assignments. Sections 8.3.1 and 8.3.2 provide 
alphabetical and numerical pin listings. 
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Figure 8-1 DECchip 21071-DA Pinout Diagram 
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8.3.1 DECchip 21 071 -DA Alphabetical Pin Assignment List 

Table 8-10 lists the DECchip 21071-DA pins in alphabetical order. The 
following list describes the abbreviations used in the Type column of the table. 

• B = Bidirectional 

• I = Input 

• P = Power 

• 0= Output 



Table 8-10 DECchip 21071-DA Alphabetical Pin Assignment List 



Pin Name 



Pin 



Type 



Pin Name 



Pin 



Type 



AD<0> 


69 


B 


AD<1> 


67 


B 


AD<2> 


64 


B 


AD<3> 


50 


B 


AD<4> 


44 


B 


AD<5> 


42 


B 


AD<6> 


40 


B 


AD^> 


38 


B 


AD<8> 


34 


B 


AD<9> 


32 


B 


AD<10> 


31 


B 


AD<11> 


28 


B 


AD<12> 


26 


B 


AD<13> 


23 


B 


AD<14> 


21 


B 


AD<15> 


19 


B 


AD<16> 


196 


B 


AD<17> 


195 


B 


AD<18> 


192 


B 


AD<19> 


191 


B 


AD<20> 


190 


B 


AD<21> 


187 


B 


AD<22> 


185 


B 


AD<23> 


184 


B 


AD<74> 


181 


B 


AD<25> 


179 


B 



AD<26> 


176 


AD<27> 


174 


AD<28> 


173 


AD<29> 


170 


AD<30> 


169 


AD<31> 


167 


CBE<0> 


36 


CBE<1> 


18 


CBE<2> 


198 


CBE<3> 


182 


clklx2 


132 


clk2Ref 


135 


cpuCReqO> 


149 


cpuCReq-^> 


150 


cpuCReq<2> 


151 


cpuCWMaskO> 


178 


cpuCWMask<L> 


194 


cpuCWMask<2> 


201 


cpuCWMask<3> 


4 


cpuCWMask<4> 


5 


cpuCWMask<2> 


6 


cpuCWMask<a6> 


7 


cpuCWMask<^> 


8 


cpuHoldAck 


164 


DevselL 


10 


epiBEnErr<0> 


57 
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Pin Name 



Pin 



Type 



Pin Name 



Pin 



Type 



epiBEnErr<L> 


58 


B 


epiBEnErr-^> 


59 


B 


epiBEnErr<3> 


60 


B 


epiData<0> 


61 


B 


epiData<l> 


62 


B 


epiData'^> 


63 


B 


epiData<3> 


66 


B 


epiData<4> 


70 


B 


epiData-^> 


71 


B 


epiData<a6> 


72 


B 


epiData'^> 


73 


B 


epiData'^> 


74 


B 


epiData'^> 


75 


B 


epiData<^0> 


76 


B 


epiData<^l> 


77 


B 


epiData<^2> 


78 


B 


epiData<13> 


81 


B 


epiData<L4> 


82 


B 


epiData<L5> 


83 


B 


epiData<^6> 


84 


B 


epiData<^7> 


85 


B 


epiData-^S> 


86 


B 


epiData<L9> 


87 


B 


epiData'^0> 


88 


B 


epiData<21> 


90 


B 


epiData<22> 


91 


B 


epiData<23> 


92 


B 


epiData<24> 


93 


B 


epiData<25> 


94 


B 



epi Data'^6> 


95 


B 


epiData<^7> 


96 


B 


epi Data <^8> 


97 


B 


epi Data •^9> 


98 


B 


epi Data <30> 


99 


B 


epi Data <31> 


100 


B 


epiEnable^> 


29 





epiEnable<l> 


35 





epiEnable-^> 


39 





epiEnable<3> 


41 





epiFromiOB 


49 





epiLineinval 


48 





epiLineSelO> 


46 





epiLineSel<l> 


55 





epiOWSel 


56 





epi Set D IMA 


47 





FrameL 


200 


B 


GntI 


166 


1 


inpVdd 


51 


P 


inpVdd 


103 


P 


inpVdd 


155 


P 


inpVdd 


207 


P 


inpVss 


104 


P 


inpVss 


52 


P 


inpVss 


156 


P 


inpVss 


208 


P 


intHwO 


203 





ioCAclc<0> 


146 


1 


ioCAclc<l> 


147 


1 
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Pin Name 



Pin 



Type 



Pin Name 



Pin 



Type 



ioCmd<^> 


160 





ioCmd<l> 


161 





ioCmd-^> 


162 





ioDataRdy 


148 


1 


ioGrant 


145 


1 


ioLineSel<0> 


24 





ioLineSel<l> 


22 





ioRequestO> 


153 





ioRequest-^> 


154 





IrdyL 


3 


B 


LockL 


13 


B 


MemAckl 


159 





MemReql 


163 


1 


outVdd 


45 


P 


outVdd 


183 


P 


outVdd 


106 


P 


outVdd 


27 


P 


outVdd 


171 


P 


outVdd 


14 


P 


outVdd 


131 


P 


outVdd 


79 


P 


outVdd 


54 


P 


outVdd 


65 


P 


outVdd 


158 


P 


outVdd 


152 


P 


outVdd 


101 


P 


outVdd 


2 


P 


outVdd 


189 


P 


outVdd 


197 


P 


outVss 


80 


P 



outVss 


199 


P 


outVss 


180 


P 


outVss 


188 


P 


outVss 


11 


P 


outVss 


193 


P 


outVss 


168 


P 


outVss 


172 


P 


outVss 


102 


P 


outVss 


43 


P 


outVss 


30 


P 


outVss 


89 


P 


outVss 


177 


P 


outVss 


120 


P 


outVss 


186 


P 


outVss 


105 


P 


outVss 


141 


P 


outVss 


53 


P 


outVss 


37 


P 


outVss 


16 


P 


outVss 


1 


P 


outVss 


25 


P 


outVss 


20 


P 


outVss 


68 


P 


outVss 


157 


P 


outVss 


33 


P 


outVss 


175 


P 


outVss 


130 


P 


outVss 


205 


P 


Par 


17 


B 


pClk 


206 


1 
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Pin Name 



Pin 



Type 



Pin Name 



Pin 



Type 



PerrL 


15 


B 


pTestOut 


202 





ReqL 


165 





resetL 


204 


1 


scanEn 


136 


1 


StopL 


12 


B 


sysAdr'^> 


107 


B 


sysAdr<a6> 


108 


B 


sysAdr"i7> 


109 


B 


5ysAdr<£> 


110 


B 


sysAdr<S> 


111 


B 


sysAdr<10> 


112 


B 


sysAdr<ll> 


113 


B 


sysAdr<12> 


114 


B 


sysAdr<13> 


115 


B 


sysAdr<14> 


116 


B 


sysAdr<15> 


117 


B 


sysAdr<16> 


118 


B 


sysAdr<17> 


119 


B 


sysAdr<18> 


121 


B 


sysAdr<18> 


121 


B 


sysAdr<19> 


12? 


B 


sysAdr<^0> 


12? 


B 


sysAdr<^l> 


124 


B 


sysAdr<^2> 


125 


B 


sysAdr-^3> 


126 


B 



sysAdr<^4> 


127 


B 


sysAdr<25> 


128 


B 


sysAdr<26> 


129 


B 


sysAdr«27> 


137 


B 


sysAdr<28> 


138 


B 


sysAdr<29> 


139 


B 


sysAdr<30> 


140 


B 


sysAdr<31> 


142 


B 


sysAdr<32> 


143 


B 


sysAdr<33> 


144 


B 


testMode 


133 


1 


TrdyL 


9 


B 


tri State 1 


134 


1 
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8.3.2 Numerical DECchip 21071-DA Pin Assignment List 

Table 8-11 lists the DECchip 21071-DA pins in numerical order. The following 
list describes the abbreviations used in the Type column of the table. 

• B = Bidirectional 

• I = Input 

• P = Power 

• 0= Output 



Table 8-11 DECchip 21071-DA Numerical Pin Assignment List 



Pin Name 



Pin 



Type 



Pin Name 


Pin 


AD<12> 


26 


outVdd 


27 


AD<11> 


28 


epiEnable<0> 


29 


outVss 


30 


AD<10> 


31 


AD<9> 


32 


outVss 


33 


AD<8> 


34 


epiEnable-^> 


35 


CBE<0> 


36 


outVss 


37 


AD^> 


38 


epiEnable-^> 


39 


AD<6> 


40 


epiEnable<3> 


41 


AD<5> 


42 


outVss 


43 


AD<4> 


44 


outVdd 


45 


epiLineSel<0> 


46 


epiSelDMA 


47 


epiLinelnval 


48 


epiFromlOB 


49 


AD<3> 


50 



Type 



outVss 


1 


P 


outVdd 


2 


P 


IrdyL 


3 


B 


cpuCWMask<3> 


4 




cpuCWMask<4> 


5 




cpuCWMask<2> 


6 




cpuCWMask<6> 


7 




cpuCWMask-^> 


8 




TrdyL 


9 


B 


DevSelL 


10 


B 


outVss 


11 


P 


StopL 


12 


B 


LockL 


13 


B 


outVdd 


14 


P 


PerrL 


15 


B 


outVss 


16 


P 


Par 


17 


B 


CBE<1> 


18 


B 


AD<15> 


19 


B 


outVss 


20 


P 


AD<14> 


21 


B 


ioLineSel<l> 


22 





AD<13> 


23 


B 


ioLineSelO> 


24 





outVss 


25 


P 



B 
P 
B 
O 
P 
B 
B 
P 
B 
O 
B 
P 
B 
O 
B 
O 
B 
P 
B 
P 
O 
O 
O 
O 
B 
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Pin Name 



Pin 



Type 



Pin Name 



Pin 



Type 



inpVdd 


51 


P 


inpVss 


52 


P 


outVss 


53 


P 


outVdd 


54 


P 


epiLineSel<L> 


55 





epiOWSel 


56 





epiBEnErrO> 


57 


B 


epiBEnErr<l> 


58 


B 


epiBEnErr-^> 


59 


B 


epiBEnErr<3> 


60 


B 


epiDataO> 


61 


B 


epiData<L> 


62 


B 


epiData'^> 


63 


B 


AD<2> 


64 


B 


outVdd 


65 


P 


epiData<3> 


66 


B 


AD<1> 


67 


B 


outVss 


68 


P 


AD<0> 


69 


B 


epiData<4> 


70 


B 


epiData<2> 


71 


B 


epiData<6> 


72 


B 


epiData'^> 


73 


B 


epiData'^> 


74 


B 


epiData'^> 


75 


B 


epiData<^0> 


76 


B 


epiData-^l> 


77 


B 


epiData<^2> 


78 


B 



outVdd 


79 


P 


outVss 


80 


P 


epiData<13> 


81 


B 


epiData<14> 


82 


B 


epiData<15> 


83 


B 


epi Data<16> 


84 


B 


epiData<17> 


85 


B 


epi Data <18> 


86 


B 


epi Data <19> 


87 


B 


epi Data <^0> 


88 


B 


outVss 


89 


P 


epi Data <^1> 


90 


B 


epi Data <^2> 


91 


B 


epi Data •^3> 


92 


B 


epiData<^4> 


93 


B 


epiData<^5> 


94 


B 


epi Data'^6> 


95 


B 


epi Data <^7> 


96 


B 


epi Data <28> 


97 


B 


epiData<^9> 


98 


B 


epiData<30> 


99 


B 


epi Data <31> 


100 


B 


outVdd 


101 


P 


outVss 


102 


P 


inpVdd 


103 


P 


inpVss 


104 


P 


outVss 


105 


P 


outVdd 


106 


P 
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Pin Name 



Pin 



Type 



Pin Name 



Pin 



Type 



sysAdr«5> 


107 


B 


sysAdr<a6> 


108 


B 


sysAdr"i7> 


109 


B 


sysAdr-^> 


110 


B 


sysAdr<^> 


111 


B 


sysAdr<^0> 


112 


B 


sysAdr<^l> 


113 


B 


sysAdr<^2> 


114 


B 


sysAdr<^3> 


115 


B 


sysAdr<a4> 


116 


B 


sysAdr<^5> 


117 


B 


sysAdr<L6> 


118 


B 


sysAdr<^7> 


119 


B 


outVss 


120 


P 


sysAdr<^S> 


121 


B 


sysAdr«^9> 


17? 


B 


sysAdr<20> 


123 


B 


sysAdr<21> 


124 


B 


sysAdr<22> 


1?S 


B 


sysAdr<23> 


126 


B 


sysAdr<24 


127 


B 


sysAdr<25> 


128 


B 


sysAdr<26> 


129 


B 


outVss 


130 


P 


outVdd 


131 


P 


clklx2 


132 




testMode 


133 




tri5tate_l 


134 




clk2Ref 


13S 




scanEn 


136 




sysAdr'^7> 


137 


B 



sysAdr<^8> 


138 


sysAdr<^9> 


139 


sysAdr<30> 


140 


outVss 


141 


sysAdr<31> 


142 


sysAdr<32> 


143 


sysAdr<33> 


144 


ioGrant 


145 


ioCAck<0> 


146 


ioCAck<l> 


147 


ioDataRdy 


148 


cpuCReqO> 


149 


cpuCReq<L> 


150 


cpuCReq-^> 


151 


outVdd 


152 


ioRequestO> 


153 


ioRequest<L> 


154 


inpVdd 


155 


inpVss 


156 


outVss 


157 


outVdd 


158 


MemAckl 


159 


ioCmdO> 


160 


ioCmd<L> 


161 


ioCmd<2> 


162 


MemReql 


163 


cpuHoldAck 


164 


ReqL 


165 


GntL 


166 


AD<31> 


167 


outVss 


168 



p 
o 
o 
p 
p 
p 
p 
o 
o 
o 
o 
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Pin Name 



Pin 



Type 



Pin Name 



Pin 



Type 



AD<30> 


169 


B 


AD<29> 


170 


B 


outVdd 


171 


P 


OutVss 


172 


P 


AD<28> 


173 


B 


AD<27> 


174 


B 


outVss 


175 


P 


AD<26> 


176 


B 


outVss 


177 


P 


cpuCWMaskO> 


178 


1 


AD<25> 


179 


B 


outVss 


180 


P 


AD<?4> 


181 


B 


CBE<3> 


182 


B 


OutVdd 


183 


P 


AD<23> 


184 


B 


AD<22> 


185 


B 


outVss 


186 


P 


AD<21> 


187 


B 


outVss 


188 


P 


outVdd 


189 


P 


AD<20> 


190 


B 


AD<19> 


191 


B 


AD<18> 


192 


B 


outVss 


193 


P 


cpuCWMask<L> 


194 


1 


AD<17> 


195 


B 



AD<16> 


196 


B 


outVdd 


197 


P 


CBE<2> 


198 


B 


outVss 


199 


P 


FrameL 


200 


B 


cpuCWMask<^> 


201 


1 


pTestOut 


202 





intHwO 


203 





resetL 


204 


1 


outVss 


205 


P 


pClk 


206 


1 


inpVdd 


207 


P 


inpVss 


208 


P 



8.4 DECchip 21071-DA Mechanical Specifications 

Figure 8-2 shows DECchip 21071-DA package dimensions. 
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Figure 8-2 DECchip 21071-DA Package Dimensions 




DIM 


Millimeters 


Inches | 


MIN 


MAX 


MIN 


MAX 


A 


30.50 


30.77 


1.201 


1.211 


B 


27.90 


28.10 


1.098 


1.106 


C 


30.50 


30.77 


1.201 


1.211 


D 


27.90 


28.10 


1.098 


1.106 


G 


0.23 


0.33 


0.009 


0.013 


H 


.500 BSC 


0.0197 BSC 1 


J 


0.45 


0.62 


0.018 


0.024 


K 


3.45 


3.85 


0.136 


0.152 


L 


0.13 


0.23 


0.005 


0.009 


M 


0.25 


0.35 


0.010 


0.012 


R 


25.5 REF 


1.004 REF 


S 


25.5 REF 


1.004 REF 



LJ-03666-TI0 
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9 

DECchip 21 071 -DA Architecture Overview 



This chapter describes the 21071-DA architecture. The 21071-DA chip is a 
bridge between the PCI local bus and the Alpha 21064 microprocessor, its 
Bcache, and memory. The 21071-DA chip contains all the control functions of 
the bridge, as well as some data path functions. Other data path functions 
reside within the 21071-BA chip. 

The 21071-DA chip can be divided into two major sections: 

• sysBus (processor, memory) interface 

• PCI interface 

The following sections provide an overview of the architectural features of the 
sysBus and PCI interfaces. 

Figure 9-1 shows a block diagram of the DECchip 21071-DA chip. 
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Figure 9-1 DECchip 21071-DA Block Diagram 
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9.1 sysBus Interface Architecture 

The sysBus interface includes the sysBus control state machine, the address 
decode for CPU-initiated transactions, buffering for CPU-initiated transactions, 
and the control and status registers of the 21071-DA chip. 

9.1.1 Address Decode 

The 21071-DA chip provides logic for translating and extending the DECchip 
21064 34-bit physical address space into 32-bit PCI address space and vice 
versa. The address decode in the 21071-DA chip uses the address mapping 
and translation scheme described in Section 10.1 to generate PCI addresses on 
CPU-initiated transactions. All systems using the 21071-DA chip are required 
to follow this address mapping scheme. 
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9.1.2 Buffering for I/O Write Transactions 

The 21071-DA chip supports write-and-run I/O write transactions using a 
1-entry deep write buffer. The address and control mechanism are in the 
21071-DA chip; the corresponding data is stored in the 21071-BA chip. 

As soon as an I/O write transaction is received on the sysBus, the data and 
address is loaded in the write buffer and the transaction is acknowledged 
on the sysBus. Subsequent I/O transactions to the 21071-DA chip are not 
acknowledged until the previous I/O write transaction is completed. The I/O 
write could be directed towards the 21071-DA CSRs or the PCI bus. 

The 21071-BA chip provides a holding buffer to store write data for one 
subsequent write transaction. If the I/O write buffer is occupied, and another 
I/O write to the 21071-DA chip appears on the sysBus, the data of that write is 
captured from the sysBus and is loaded into the holding buffer. Even though 
the data is loaded into the holding buffer, the sysBus transaction is stalled 
until the I/O write buffer is free. The holding buffer is required so that all 
the write data can be captured before suspending the write transaction for 
deadlock resolution. See Section 9.4.2 for details. 

The description of the holding buffer and I/O write buffer is a conceptual one. 
I n the actual implementation there are two data buffers, and they alternate as 
I/O write and holding buffers. 

9.1 .3 Buffering for I/O Read Data 

The 21071-DA chip provides data buffering for one I/O read transaction 
initiated by the CPU. The I/O read buffer resides in the 21071-BA chip, but 
is controlled by the 21071-DA chip. The I/O read buffer is only a temporary 
holding buffer, and is invalidated at the end of every I/O read transaction. The 
I/O read buffer is loaded with data received from the PCI or the 21071-DA 
CSRs depending on whether the transaction is addressed to the PCI or the 
CSRs. The I/O read buffer is necessary to make the sysBus interface and PCI 
interfaces independent of each other. An I/O read may complete on the PCI, 
while the sysBus interface is busy flushing DMA writes to memory. (This is 
done by suspending the I/O read transaction using a preempt DMA request to 
the sysBus arbiter; see Sections 9.4.2 and 9.4.1 for details.) The I/O read buffer 
allows the PCI transaction to terminate without waiting for the read data to be 
returned to the CPU. 
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9.1.4 Wrapping for I/O Transactions 

The CPU must be configured in wrap mode for I/O reads to function correctly. 
The requested quadword is the only one that is returned on I/O read 
transactions. 

9.2 PCI Interface Architecture 

The PCI interface of the 21071-DA chip is a fully compliant PCI host bridge. 
It behaves as a master on the PCI on CPU-initiated transactions and is a 
target on memory space transactions initiated by other PCI masters. The 
architectural features of the PCI interface are described in the following 
sections. 

9.2.1 DMA Address Translation 

The PCI interface supports direct and scatter/gather mapping from the 32-bit 
PCI address to the 34-bit physical address space. It provides two windows 
which can be mapped to regions within the PCI address space. Each address 
region can be independently programmed to be direct mapped or scatter/gather 
mapped. 

If the address region is direct mapped, then the PCI address is directly sent 
out on the sysBus. Higher order sysBus address bits have to be obtained from 
the PCI base address registers in the 21071-DA chip. 

If the address region is scatter/gather mapped, the PCI address indicates the 
address of a page table entry, which contains the physical address of that 
page. Thus, there is a virtual (PCI address) to physical translation involved. 
The actual scatter/gather map is stored in memory. The 21071-DA chip 
accesses the map in memory to do all the required translation. To improve 
the performance of scatter/gather mapped DMA transactions, the 21071-DA 
chip implements an 8-entry translation lookaside buffer (TLB). Incoming 
PCI addresses to scatter/gather regions are looked up in the TLB. If there is 
a hit, the translation is done within the 21071-DA chip. If there is a miss, 
then the 21071-DA chip reads memory (through the sysBus) to obtain the 
required page table entry. The entry is then loaded into the TLB; a round- 
robin replacement scheme is used. The translation is done by the 21071-DA 
chip and the transaction is completed on the sysBus. 

For details about the actual mapping scheme and the page table entry format, 
see Chapter 10. 
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Note 

The slave machine of the DECchip 21071-DA PCI interface will not 
respond to a CPU-initiated address that has been driven onto the PCI 
by the master machine of the PCI interface, even if the address hits the 
programmed PCI DMA window. That is, the DECchip 21071-DA chip 
does not support loopback mode on the PCI . 



9.2.2 DMA Write Buffer 

The PCI interface has a write buffer for buffering DMA write data. The 
DMA write buffer is made up of four entries; each entry contains a cache 
line address, 8 longwords of data, the byte enables corresponding to each 
longword, a lock bit, a mask bit, a flush bit, and a valid bit for the entry. 
The untranslated PCI address is stored in the DMA write buffer. Address 
translation is performed when the particular entry is unloaded from the DMA 
write buffer. The address and control bits are stored in the 21071-DA chip, and 
corresponding data and byte enables are stored in the 21071-BA chip. 

Data is received on the PCI and is transferred to the 21071-BA chip over 
the epIData bus. When the transaction is completed on the PCI, the entry 
is marked valid and is available for unloading. A subsequent PCI write 
transaction to the same cache line will consume a separate write buffer entry. 
The 21071-DA chip does not support merging of write transactions. 

DMA reads are allowed to bypass the writes in the DMA write buffer, 
depending on the state of the dByp<l:0> bits from the DCSR. This improves 
average DMA read latency considerably, because most DMA reads are not 
expected to match addresses in the write buffer. When the dByp<l:0> mode 
indicates full bypass, read address bits <31:6> are compared with those of the 
buffered writes. If there is no match, the read is serviced ahead of the writes. 
When the dByp<l:0> mode indicates partial bypass, read bypass happens 
only if the read page offset does not match the write page offset; only address 
bits <12:6> are compared. This mode can be used if comparing virtual (PCI ) 
addresses between reads and writes is not desirable or could lead to coherency 
problems. In the No_Bypass mode, DMA reads are stalled until all the DMA 
writes have been flushed out of the write buffer. 
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There are two situations when read bypassing is disabled independent of the 
programmed value of dByp<l:0>: 

• The 21071-DA chip does not allow DMA reads to bypass buffered DMA 
writes if any of the buffered writes were locked by a PCI master. 

• The DMA write buffer has to be flushed to memory on memory barriers 
from the DECchip 21064 microprocessor to ensure data coherency. The 
21071-DA chip does not permit DMA reads to bypass buffered DMA writes 
while flushing the write buffer. See Section 9.4.1. 

If the DMA write buffer is full and a DMA write to memory is initiated on the 
PCI, the transaction is disconnected by the 21071-DA PCI interface without 
accepting any data. If the buffer is filled during a PCI DMA write transaction, 
the transaction is disconnected, and no more data is accepted by the 21071-DA 
PCI interface. 

9.2.3 DMA Read Buffer 

The 21071-DA chip controls the DMA read buffer located in the 21071-BA chip. 
The buffer stores up to 16 longwords of data organized as two cache lines. A 
valid bit is implemented along with each longword. Data received from the 
sysBus (memory or cache) is loaded into the DMA read buffer by the sysBus 
interface, and the corresponding valid bit is set. The data is unloaded by the 
PCI interface. 

The DMA read buffer does not require an address to be stored, because the 
contents of the buffer are invalidated at the end of the current PCI read 
transaction. There is n&/er any stale data in the DMA read buffer. 

9.2.4 PCI Burst Length and Prefetching 

The PCI interface supports a maximum burst length of 16 longwords on PCI 
write transactions directed toward main memory If the PCI write transaction 
starts on an even cache line boundary with PCI Address<5> = and PCI 
Address<4:2> = 0, a full burst of 16 longwords is supported. The transaction 
will be terminated using a PCI disconnect after the sixteenth longword 
has been received. In all other cases, the actual burst will be less than 16 
longwords. These cases are described here: 

• When a burst order other than linear incrementing is specified by the 
master, the transaction length is kept to one transfer. See Section 9.2.5. 

• When the transaction starts on an even cache line boundary, but PCI 
address <4:2> are non-zero. I n this case the first cache line is a partial 
write. 
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• When the transaction starts on an odd cache line boundary, PCI address 
<5> = 1. The burst length is this case is < 8 longwords. 

• If there is only one cache line entry available in the DMA write buffer, 
the burst is terminated after < 8 longwords of data have been transferred, 
even if the transfer started on an even cache line boundary. This is because 
after that cache line has been loaded into the write buffer, the buffer is full. 

On DMA read transactions, a maximum burst length of 8 longwords is 
supported if DMA prefetching is not enabled in the 21071-DA chip, and a PCI 
read multiple command was not used by the requesting device. A maximum 
burst length of 16 longwords is supported if DMA prefetching is enabled 
in the 21071-DA chip, or a PCI read multiple command was used by the 
requesting device. The following describes the various cases of DMA read burst 
transactions and indicates the burst length. 

• When a burst order other than linear incrementing is specified by the 
master, the burst length is kept to 1 longword. See Section 9.2.5. 
Prefetching is performed if prefetching is enabled or a read multiple 
command is specified on the PCI, and the transaction starts on an even 
cache line. 

• When prefetching is not enabled and the incoming PCI command is not a 
read multiple, and when the PCI transaction starts with PCI address<4:2> 
= 0, the PCI interface disconnects the transaction after 8 longwords have 
been transferred on the PCI . No prefetching is performed. 

• When prefetching is not enabled and the incoming PCI command is not 
a read multiple, and when the PCI transaction starts with a non-zero 
value on PCI address<4:2>, the PCI interface disconnects the transaction 
after < 7 longwords have been transferred on the PCI. No prefetching is 
performed. 

• When prefetching is enabled or a read multiple command is specified on 
the PCI, and the transaction starts on an even cache line boundary with 
PCI address<4:2> = 0, the PCI interface disconnects the transaction after 
16 longwords have been transferred on the PCI . The odd cache line is 
prefetched. 

• When prefetching is enabled or a read multiple command is specified on 
the PCI, and the transaction starts on an even cache line boundary with 
a non-zero value on PCI address<4:2>, the PCI interface disconnects the 
transaction after < 15 longwords have been transferred on the PCI. The 
odd cache line is prefetched. 
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• When prefetching is enabled or a read multiple command is specified on 
the PCI, and the transaction starts on an odd cache line boundary, the 
PCI interface disconnects the transaction after < 8 longwords have been 
transferred on the PCI. No prefetching is performed. 

On CPU-initiated read transactions, when the 21071-DA chip is a master on 
the PCI, a maximum burst length of 2 is supported. 

On CPU-initiated write transactions, when the 21071-DA chip is a master on 
the PCI, a maximum burst length of 2 is supported in sparse memory and I/O 
spaces, and a maximum burst length of 8 is supported in dense memory space. 

9.2.5 PCI Burst Order 

Bits <l:0>of the PCI address are used to specify the burst order requested by 
the master during memory transactions. When the 21071-DA is a master of 
the PCI, it will always indicate a linear incrementing burst order (AD<1:0> = 
0) on read and write transactions. 

On DMA transactions, the 21071-DA supports burst transfers only when a 
linear incrementing burst order is specified. If the master specifies a burst 
order other than that (AD<l:0>is non-zero), then the PCI interface disconnects 
the transaction after one data transfer. 

9.2.6 PCI Parity Support 

All PCI devices are required to generate parity across AD<31:0> (data and 
address lines) and C/BE#<3:0> (command/byte enables). The 21071-DA chip 
complies with this specification. 

When it is master of the PCI, it also checks the incoming parity on I/O reads, 
interrupt vector reads, and configuration reads during data phases. 

When it is a target on the PCI, it checl<s parity during the address phase, and 
during data phases on memory write transactions. 

9.2.7 PCI Exclusive Access 

The 21071-DA chip supports the PCI Exclusive Access protocol using the 
Locl<L signal. A locked transaction to main memory on the PCI causes the PCI 
interface to lock out all non-exclusive main memory accesses initiated by PCI 
masters. This is done by disconnecting the PCI transaction without completing 
any data transfers. Until the Lock is cleared on the PCI, only the PCI master 
that locked main memory is allowed to complete transactions to main memory. 
Refer to the PC/ Local Bus Specification for details. 
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On the sysBus side, the PCI lock causes the system lock flag to be cleared 
by using the ioClrLock command encoding on the ioCmd signals. The system 
lock flag is held cleared until all locked DMA reads and locked DMA writes to 
memory have been completed on the sysBus, and the Lock is cleared on the 
PCI. 

As a master on the PCI, the 21071-DA chip does not initiate locked 
transactions. See Section 11.2.3 for a detailed description of the 21071-DA 
response to locked transactions initiated by other master devices. 

9.2.8 PCI Bus Parking 

When no devices are requesting bus mastership, it is recommended that the 
system arbiter grant default bus ownership to the 21071-DA chip by asserting 
its GntL signal. This will reduce the latency for CPU -initiated transfers to the 
PCI when the bus is idle. Granting the PCI to a device when no requests are 
pending is referred to as parking in the PCI Local Bus Specification. If the 
21071-DA chip is granted the bus when it is not requesting the PCI, it will 
drive the AD<31:0>, CBE_I<3:0>, and PAR signals. 

The 21071-DA chip also supports PCI bus parking during reset. If the GntL 
signal is asserted by the PCI arbiter (ReqL is always tristated by the 21071-DA 
chip during reset), the 21071-DA chip will drive AD<31:0>, BE<3:0>#, and (one 
clock cycle later) PAR. When GntL is deasserted, the 21071-DA chip tristates 
these signals. 

9.2.9 PCI Retry Timeout 

The 21071-DA chip implements a timeout mechanism to terminate CPU- 
initiated transactions that do not complete on the PCI because of too many 
disconnects or retries. When it initiates a CPU transaction on the PCI, the 
21071-DA chip counts the number of times it gets retried or disconnected, 
and if the number exceeds 2^^ it flags an error to the CPU and aborts the 
transaction. The 21071-DA chip considers losing GntL during address stepping 
(for configuration cycles) as a disconnect for the purposes of this timeout 
mechanism. 

9.2.10 PCI Master Timeout 

The PCI protocol specifies a mechanism to limit the duration of a master's 
burst sequence. The mechanism requires a PCI master to implement a latency 
timer that counts the number of cycles since the assertion of FRAME*. If the 
master latency timer has expired and the master's grant has been taken away, 
the master is required to surrender the bus. This mechanism is intended to 
prevent masters from holding bus ownership for extended periods of time. 
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trading off high throughput for low latency. The 21071-DA implements a 
programmable master latency timer. 

9.2.11 Address Stepping in Configuration Cycles 

The 21071-DA chip does not have dedicated IDSEL#pins for use in PCI 
configuration cycles. Because AD<31:11> are not used during configuration 
cycles, they are connected to the IDSEL#pins of the various PCI devices. 
These devices can then uniquely be selected during configuration cycles by 
using addresses which assert only one bit of AD<31:11> at a time. By doing 
this, an added load is presented to those address lines that are connected 
to the IDSEL#pins of PCI devices. This load can be reduced by resistively 
coupling the line to the pin; however, the time for the signal to become valid at 
thelDSEL#pin is then increased. 

In order to provide flexibility and reduce design complexity when using 
resistive coupling to IDSEL#pins, the 21071-DA chip performs address 
stepping on configuration reads and write transactions. For these transactions, 
the 21071-DA chip will drive the PCI bus for two clock cycles during the 
address phase in order for the IDSEL#pins of all the PCI devices to reach a 
valid logic level. 

The 21071-DA chip does not perform address or data stepping in any other 
case. 

9.3 Transactions 

This section describes the transactions performed by the 21071-DA chip. 

9.3.1 sysBus Transactions 

The 21071-DA chip is a master and a slave on the sysBus. When it is a master, 
it performs DMA transactions on the sysBus. When it is a slave, it responds to 
I/O transactions initiated by the CPU. 

9.3.1.1 CPU-Initiated Transactions 

When the CPU is master of the sysBus, the sysBus interface monitors the 
commands and addresses sent out by the CPU. If the addresses are within 
the 21071-DA chip address range, and the command is valid, the 21071-DA 
chip responds to the transaction. The 21071-DA chip does not acknowledge the 
CPU directly. Acknowledgments are communicated to the CPU through the 
21071-CA chip, and data is communicated through the 21071-BA chips. 

The following transactions are supported by the 21071-DA sysBus interface: 

• Read Block to Remote (PCI ) Space 
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The 21071-DA chip responds to the transaction by notifying the 21071-CA 
chip when data is ready in the I/O read data buffer. The 21071-DA chip 
may choose to preempt this transaction if a DIM A read transaction is in 
progress on the PCI, or if the DMA write buffer is full, and the DMA 
transaction needs to get on to the sysBus (deadlocl< resolution). 

The 21071-DA chip supports longword or quadword reads in PCI space. 
Read Block to Local (CSR) Space 

This is treated similarly to the read blocl< to remote space. The only 
difference is in the conditions for preemption. This transaction is 
preempted only if it is stalled on the sysBus and queued behind an I/O 
write transaction that cannot be completed until a DMA transaction on the 
PCI can access the sysBus (deadlock resolution). 

The 21071-DA chip supports only longword reads aligned on cache line 
boundaries in CSR space. 

Write Block to Remote or Local Space 

The 21071-DA chip acknowledges the transaction when all previous I/O 
writes have been completed on the PCI . This transaction is preempted only 
if it is stalled on the sysBus and queued behind an I/O write transaction 
that cannot be completed until a DMA transaction on the PCI can get onto 
the sysBus (deadlock resolution). 

The 21071-DA chip supports only longword writes in CSR space, up to 
quadword writes in sparse PCI space, and up to 8 longword writes in dense 
PCI memory space. 

LDx_L to I/O Space 

This is treated just like a read block to I/O space. 

STxC to I/O Space 

This is treated just like a write block to I/O space. 
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• Barrier 

The 21071-DA chip uses the barrier command to ensure synchronization 
between the CPU and DIM A devices on the PCI. It does not acl<nowledge 
the command until the I/O write buffer has been flushed, and any writes 
that were present in the DMA write buffer when the barrier command was 
received have been flushed to memory. 

• Fetch, FetchM to 21071-DA Space 

The 21071-DA chip does not do anything special on a fetch, fetchM 
transaction to its address space. It sends an acl<nowledgment to the 
21071-CA chip as soon as it sees the command on the bus. 

9.3.1.2 PCI-lnitiated Transactions 

Transactions from PCI devices to main memory cause the 21071-DA chip to 
arbitrate for the sysBus and perform DMA transactions to memory. 

DMA transactions on the sysBus always start with an arbitration cycle where 
the 21071-DA chip asserts oneof thetiiree possible request codes to the arbiter. 
When the grant is received, the address is driven on to the sysAdr bus, which 
is common to the CPU, the Bcache, the 21071-CA, and the 21071-DA chips. 
Data is transferred between the 21071-BA and the 21071-DA chips using the 
epIBus prior to the start of a DMA write or as data is returned on DMA reads. 

The sysBus interface uses the atomic request when it has to prefetch read 
data or when it needs to perform a scatter/gather lool<up. It does at most two 
memory read transactions during such a request. It uses the preempt request, 
when it has to suspend the current CPU transaction, which is targeted towards 
it, in order to let a DMA transaction complete (Section 9.4.2). At all other 
times, it uses the normal DMA request. 

The following DMA transactions are performed by the 21071-DA chip on the 
sysBus: 

• PCI DMA Read 

On a PCI DMA read transaction, the 21071-DA chip could use one of four 
sysBus DMA read commands: 

- DMA read 

- DMA read wrapped 

- DMA read burst 

- DMA read burst wrapped 
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The wrapped qualifier is used to indicate whether the lower octaword of 
data from the cache line is requested or the upper octaword is requested. 
The wrapped command is used when the upper octaword is requested. 

The burst qualifier is a hint to the memory controller that the following 
read transaction is likely to be in the same page. The burst command is 
used when the 21071-DA chip is likely to prefetch data from memory. The 
maximum data that can be prefetched is a cache line; a memory read on 
the PCI can be at most 16 longwords (Section 9.2.4). The 21071-DA chip 
uses DMA read burst on the first cache line read indicating that it is going 
to follow it up with another read. The second read uses the DMA read 
command because that cache line is the end of the burst. 

Scatter/Gather Read 

21071-DA chip performs a DMA read or DMA read wrapped transaction on 
the sysBus when it needs to read the scatter/gather map. 

PCI DMA Write 

A PCI DMA write transaction causes a DMA write full or a DMA write 
masked command on thesysBus. A DMA write full command is used when 
the whole cache line will be written to memory, and a DMA write masked 
command is used when the cache line has to be written partially. The byte 
masks for the data are transferred to the 21071-BA chip along with the 
data. 



9.3.2 PCI Transactions 

The 21071-DA chip supports the following transactions on the PCI: 

• Interrupt acknowledge: PCI master. 

• Special cycle: PCI master. 

• I/O read: PCI master. 

• I/O write: PCI master. 

Memory read: A slave, when the transaction is initiated by another PCI 
device accessing system memory. A master, when the CPU is accessing an 
address in PCI memory space. 

Memory write: A slave, when the transaction is initiated by another PCI 
device accessing system memory. A master, when the CPU is accessing an 
address in PCI memory space. 



Configuration read: PCI master. 
Configuration write: PCI master. 
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• Memory write and invalidate: PCI slave; treated just like a memory write. 

• Memory read line: PCI slave; treated just like a memory read. 

• Memory read multiple: PCI slave; cache line read prefetch is performed 
irrespective of the state of the prefetch enable bit. 

• Dual address cycles: Ignored. 

9.4 Miscellaneous Architectural Issues 

This section describes the miscellaneous architectural issues, including: 

• Data coherency 

• Deadlock resolution 

• Guaranteed access time mode support 

9.4.1 Data Coherency 

There are generally two agents in the system whose data transfer actions need 
to be synchronized: 

• TheCPU 

• A remote PCI device 

The 21071-DA chip maintains data coherency and synchronization between 
these two agents using the following mechanisms: 

• The 21071-DA chip preserves strict ordering of DMA writes initiated on the 
PCI. 

• DMA reads can bypass writes that are not to the same address (double 
cache line). Strict ordering is maintained between reads and writes to the 
same address. 

• I/O transfers from the CPU to the PCI or to 21071-DA CSRs are performed 
in order. This policy guarantees a coherent view of PCI I/O space from the 
CPU viewpoint. 

• The 21071-DA chip flushes DMA write data to memory prior to 
acknowledging a barrier command from the CPU. Because explicit ordering 
commands are absent on the PCI, the software MB instruction is used to 
order CPU and DMA accesses. 
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• The 21071-DA chip also flushes the I/O write buffer to the PCI before 
acknowledging a barrier command. This preserves the order between CPU 
I/O accesses and CPU memory accesses. 

• The 21071-DA chip clears the system lock flag on PCI exclusive reads and 
writes to system memory. 

9.4.2 Deadlock Resolution 

I n a 21071 or 21072 system, two major buses are allocated for use during data 
transfers— the sysB us and the PCI. Some data transfers require the use of both 
of these buses to complete. I n particular, CPU I/O transfers to or from the PCI 
require ownership of the sysBus followed by ownership of the PCI. Similarly, 
PCI DMA transfers to or from the memory subsystem require ownership of the 
PCI followed by ownership of the sysBus. 

Because of the non-pended nature of these buses, during read transfers (I/O or 
DMA), both buses must be held at the same time for the transfer to complete. 
Generally during write transfers (I/O or DMA), because the 21071-DA chip 
features write-and-run style buffering, only one bus must be held at a time. 
However, when a write buffer is full, both buses must be held at the same 
time so that some data from the write buffer can be flushed before new data is 
accepted. 

For any transfer requiring the use of both buses, the 21071-DA chip is 
responsible for acquiring the second level bus on behalf of the initiator. 
Deadlocks can occur when the CPU and a remote PCI agent have both 
initiated transfers requiring the use of both the sysBus and the PCI bus. If the 
CPU has already acquired the sysBus, and the PCI agent has already acquired 
the PCI bus, the 21071-DA chi p would be unable to complete either transaction 
without resolving the deadlock. 

The 21071-DA chip resolves deadlock by forcing the CPU to relinquish 
ownership of the sysBus thereby giving priority to the PCI agent. By giving 
priority to the PCI agent, the 21071-DA chip gives the system designer more 
flexibility when choosing PCI devices. In particular, this flexibility allows 
designers to choose devices that resort to using PCI disconnect when handling 
deadlock situations that arise at their end. The 21071-DA chip forces the CPU 
to relinquish the sysBus by using a preempt request while arbitrating for the 
sysBus. 
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9.4.3 Guaranteed Access Time Mode Support for Intel 82375EB and 
82378IB ISA/EISA Bridges 

The Intel 82375EB and 823781 B EISA/ISA bridges (EIB) provide three 
sideband signals to provide mechanisms for flushing system write buffers and 
to allow a guaranteed access time of 2.1 fis to a master on the ISA/EISA bus. 
The three signals are: 

• FLUSHREQ* 

• MEMREQ* 

• MEMACK# 

The first two are outputs from the EIB, and the last one is an input to the EIB. 

• The EIB asserts MEMREQ#and FLUSHREQ#when it requires 
guaranteed access from memory. It expects the host bridge to assert 
MEMACK#when it has cleared the path to memory. This is accomplished 
by flushing any posted writes and disabling the posting of any further 
writes, thereby guaranteeing an access time of 2.1 fis on the bus. 

• The EIB keeps MEM REQ#deasserted and asserts FLUSH REQ#when it 
requires that posted writes from the CPU to the PCI and from the PCI to 
the CPU be flushed to prevent deadlocks between a DMA request from an 

I SA master and an I SA bus access from the host bridge. I n this case too, it 
expects to see M EM AC K# asserted when the appropriate buffers have been 
flushed. 

The 21071-DA chip provides its own mechanism for deadlock prevention, 
by preempting CPU transactions to allow DMA transactions to complete. It 
therefore does not need to support the deadlock prevention mechanism of the 
EIB, and it does not implement the FLUSHREQ* protocol. 
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Note 

Because the 21071-DA chip does not implement FLUSH REQ#, external 
logic must force the assertion of MEMACK#tothe EIB upon the 
assertion of FLUSHREQ*. The equation for MEMACK#going to the 
EIB should be as follows: 

EIB_MEMACK#=NOT ( (MEMREQ#AND (NOT FLUSHREQi^ ) OR 
(NOT DA_MEMACK#)) 

To meet MEMACK#timing requirements, the output of this equation 
may need to be clocl<ed through a flip-flop. The equation and output 
flip-flop could both be implemented in a single PAL. 



9.4.3.1 DECchip 21071-DA GAT Mode Operation 

The 21071-DA chip takes the following action when it sees MemReql asserted: 

1. The 21071-DA chip turns down its DMA write-and-run buffering capacity 
to a single burst of up to eight longwords. Any PCI write transaction 
directed toward the 21071-DA chip will be retried by the 21071-DA chip 
unless its DMA write buffer is empty. Read-bypass-write flows remain 
enabled. 

2. The 21071-DA chip requests the sysBus (ioRequest = regular or preempt). 

3. Once granted the sysBus, the 21071-DA chip holds the sysBus grant 
(ioRequest = atomic) until MemReql is deasserted. 

4. With the sysBus grant, the 21071-DA chip flushes all DMA write buffers 
(if non-empty) and then performs a flush transaction (ioCmd = Flush) to 
ensure that posted writes in the 21071-CA chip have completed. At the end 
of the flush transaction, the 21071-DA chip asserts MemAcl<l. 

5. While MemReql continues to be asserted, the 21071-DA chip will continue 
to service PCI transactions with write-and-run buffering capacity set to a 
single hexaword. The 21071-DA chip performs a flush transaction on the 
sysBus atomically following each DMA write transaction. Read-bypass- 
write flows will not be exercised at this point because the 21071-DA chip 
is holding the sysBus grant. (DMA writes will always start on the sysBus 
before a DMA read is far enough along on the PCI to bypass the DMA 
write.) 

6. Upon deassertion of MemReql, the 21071-DA chip deasserts MemAckl, 
returns DMA write-and-run buffering capacity to four hexawords, releases 
the sysBus grant (ioRequest = regular, preempt, or idle), and no longer 
performs flush transactions following DMA writes. 
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9.4.3.2 GAT Mode System Requirements 

Although the 21071 and 21072 chipsets provide the functionality described 
here In order to help guarantee access time, the system designer must ensure 
that worst case latencies are not excessive. The following system parameters 
must be considered: 

DRAM width 

DRAM access time 

sysClk speed 

Scatter/gather mapping of GAT mode devices 

Scatter/gather mapping of all other PCI devices 

Analysis shows that 30 ns (PCI cycle time) systems with any one of the 
following characteristics will be able to meet GAT mode latency requirements: 

• Systems with 128-bit wide memory interfaces. 

• Systems that do not scatter/gather map GAT mode devices. 

• Systems that do not scatter/gather map any other PCI devices. 

• Systems with 64-bit wide memory interfaces that utilize 60 ns DRAMs 
with appropriately programmed memory timing (assuming that either the 
refresh time does not exceed 180 ns, or the video support feature of the 
21071-CA chip is not utilized). 

• Systems that implement PCI arbiters that do not allow third-party 
scatter/gather-mapped writes to sneak in ahead of the GAT mode read. 

Systems that do not conform to any of the above specifications (including 
all systems with 40 ns PCI cycle times) will require further analysis to 
determine if GAT mode latency requirements can be met. The following 
sequence illustrates the worst case scenario and should be used as a guideline 
for further analysis. 

1. I n the time between the 21071-DA chip's assertion of MemAckl and when 
the EISA/ISA bridge (EIB) acquires the PCI bus to perform the GAT 
mode read, a third-party PCI device acquires the PCI bus and performs a 
scatter/gather-mapped partial write to main memory. 

2. The scatter/gather-mapped write is posted to the 21071 (or 21072 ) chipset, 
misses in the TLB, and therefore results in a scatter/gather read from 
memory followed by a masked write to memory. 
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3. As scxjn as the scatter/gather-mapped write completes on the PCI bus, the 
EIB can acquire the PCI bus, grant the EISA/ISA bus to the GAT mode 
device, and start the GAT mode read. Our analysis assumes that the 
GAT interval begins four PCI cycles (one EISA/ISA cycle) after the PCI 
bus is detected as idle following the third-party scatter/gather-mapped 
write. The GAT mode read will never bypass the third-party DMA 
write inside the 21071-DA because the DMA write starts on the sysBus 
immediately after it completes on the PCI (sysBus is held atomically after 
MemAckl is asserted). The GAT mode read will start on the PCI while the 
scatter/gather read for the third-party DMA write is in progress on the 
sysBus. 

4. If the read from the GAT mode device is also scatter/gather mapped 
(and also misses in the TLB), the total latency from the start of the GAT 
interval to the return of requested read data will include the time required 
to perform: 

1. Part of the scatter/gather read for third-party masked DMA write 

2. The third-party masked DMA write 

3. The scatter/gather read for the GAT mode DMA read 

4. The GAT mode DMA read 

Our analysis assumes that the GAT interval ends eight PCI cycles (two 
EISA/ISA cycles) after the first requested longword is transferred on the 
PCI bus. 

The total latency of these four transactions, plus the time required for one 
memory refresh transaction, plus the time for a 21071-CA video transaction (if 
this feature is in use), plus the eight tail-end cycles, must not exceed the GAT 
latency limit of 2.1 |js. 

Note that a major portion of the latency can be eliminated if one of the 
following occurs: 

• The GAT mode device, all other PCI devices, or both, are not scatter/gather 
mapped. 

• The PCI arbiter prevents third-party writes from sneaking in before the 
GAT mode read. 

The previous measures allow more systems, including certain 40 ns systems, 
to meet the GAT mode latency requirements; however, system designers must 
analyze their implementations in order to accurately estimate worst-case 
latencies. 
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9.5 Interrupts 

The 21071-DA chip interrupts the CPU using the intHwO signal when it 
has errors to report. The 21071-DA chip does not distinguish between hard 
and soft errors when asserting this interrupt signal. However, the software 
can mask the assertion of the interrupt signal on soft (correctable) errors by 
disabling error correction reporting using thedCEl bit in the DCSR register. 

The 21071-DA chip does not provide an interval timer interrupt. This 
functionality is expected to be provided to the CPU by some other device in the 
system. In addition, interrupts from other PCI devices or from a PCI interrupt 
controller must be sent directly to the CPU. 

The 21071-DA chip participates in the interrupt acl<nowledge process by 
responding to CPU read blocl< commands directly to the interrupt acl<nowledge 
address space, which triggers the 21071-DA chip to perform an Interrupt 
Acl<nowl edge transaction on the PCI. The interrupt vector returned on the PCI 
is returned to the CPU through the sysBus by the 21071-DA chip. 

9.6 Error Handling 

This section describes how errors are handled by the 21071-DA chip. The 
following descriptions assume that the 21071-DA error registers are not 
already locked by a previous error condition. If the 21071-DA errors registers 
are locked by an earlier error, then additional errors merely set the lost error 
bit, and, if appropriate, cause the 21071-DA chip to assert intHwO. 

I ntH wO is kept asserted as long as the corresponding error bit is set. 

The PCI error address register (PEAR) logs addresses sent out or received on 
the PCI . The sysBus error address register (SEAR) logs addresses sent out or 
received on the sysBus. The error logging CSRs are described in further detail 
in Chapter 10. 

9.6.1 CPU-Initiated Transactions 

The 21071-DA chip always returns HARD_ERROR on ioCmd<2:0> field on 
I/O read transactions that have errors. No interrupt is asserted in this case, 
because the microprocessor has been notified that the read had an error. 
In no situation does the 21071-DA chip assert SOFT_ERROR on I/O read 
transactions because the microprocessor would interpret it as a failure that 
occurred during the transaction, but was corrected. 
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I/O writes are always acknowledged with OK (100 on cpuCAck<2:0>). 
Because of the write-and-run feature for I/O writes in the 21071-DA chip, 
the transaction is always acknowledged on thesysBus before it is initiated on 
the PCI. An interrupt (intHwO) will assert to notify the microprocessor if an 
error occurs on the PCI during the I/O write. 

The actions taken on the various errors that can occur on CPU-initiated 
transactions are described in the following sections. 

9.6.1.1 No Device Error 

On an I/O transaction initiated by the 21071-DA chip, if DEVSEL#is not 
asserted within 5 cycles, the 21071-DA chip assumes that no PCI device is 
going to respond to this transaction. The following action is taken: 

The 21071-DA chip terminates the PCI transaction using the master-abort 
protocol . 

The nDev bit is set in the DCSR. The pci_Cmd field is set to the 
appropriate value depending upon the transaction. 

The PCI error address register (PEAR) contains the address sent out at the 
beginning of the PCI transaction and is locked. 

On writes, intHwO signal is asserted to interrupt the processor. 

On reads, the 21071-DA chip forces the value 101 (cpuCAck HARD_ERROR) 
on ioCmd<2:0>to end the sysBus transaction. 

To clear the error, a logic 1 must be written to the nDev bit in the DCSR. 
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9.6.1.2 Target Abort Errors 

On an I/O transaction initiated by tine 21071-DA chip, if the target device 
terminates the PCI transaction using the target-abort protocol, the following 
action is taken: 

The 21071-DA chip, as master, terminates the PCI transaction in 
accordance with the target-abort protocol. 

The tAbt bit is set in the DCSR, and the pci_Cmd field is set to the 
appropriate value depending upon the transaction. 

The PCI error address register (PEAR) contains the address sent out at the 
beginning of the PCI transaction and is locl<ed. 

On writes, intHwO signal is asserted to interrupt the processor. 

On reads, the 21071-DA chip forces the value 101 (cpuCAck HARD_ERROR) 
on ioCmd<2:0> to end the sysBus transaction. 

To clear the error, a logic 1 must be written to the tAbt bit in the DCSR. 

9.6.1.3 Address Parity Errors 

On any I/O transaction there is no way for the 21071-DA chip to determine 
that a parity error occurred in the address phase of the transaction because 
the 21071-DA chip does not have a SERR#pin. (PERR#is not used to convey 
address parity error information.) As a result, the 21071-DA chip can take no 
action. 

9.6.1.4 Read Data Parity Errors 

On an I/O read transaction initiated by the 21071-DA chip, if the parity 
generated off the incoming data sampled from the PCI AD lines (data), and the 
byte enables driven by the 21071-DA chip are different from the value sampled 
from PAR, a read data parity error condition has occurred. The following action 
is taken: 

• The transaction continues normally. 

• The 21071-DA chip asserts PERR#on the PCI. 

• The ioPE bit is set in the DCSR, and the pci_Cmd field is set to the 
appropriate value depending upon the transaction. 

• The PCI error address register (PEAR) contains the address sent out at the 
beginning of the PCI transaction and is locked. (Note: If an error occurs on 
both longwords of a quadword transaction, then the lost bit will be set.) 
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• The 21071-DA chip forces the value 101 (cpuCAck HARD_ERROR) on 
ioCmcl<2:0>to end thesysBus transaction. 

• To clear the error, a logic 1 must be written to ioPE bit in the DCSR. 

9.6.1.5 Write Data Parity Errors 

On an I/O write transaction initiated by the 21071-DA chip, if PERR#is 
asserted by the slave device for any longword of data, a write data parity error 
condition has occurred. The following action is taken: 

• The transaction completes normally on the PCI . 

• The ioPE bit is set in the DCSR, and the pci_Cmd field is set to the 
appropriate value depending upon the transaction. 

• The PCI error address register (PEAR) contains the address sent out at the 
beginning of the PCI transaction and is locked. (Note: If an error occurs on 
more than 1 longword of a single write burst the lost bit will be set.) 

• IntHwO signal is asserted to interrupt the processor. 

• To clear the error, a logic 1 must be written to the IoPE bit in the DCSR. 

9.6.1.6 Retry Timeout 

On an I/O transaction initiated by the 21071-DA chip, if the retry timeout 
counter overflows (this happens when the 21071-DA chip has been retried 2^'* 
times by the target), the 21071-DA chip does the following: 

The 21071-DA chip does not retry the transaction on the PCI again. 

The ioRT bit is set in the DCSR and the pci_Cmd field is set to the 
appropriate value depending upon the transaction. 

The PCI error address register (PEAR) contains the address sent out at the 
beginning of the PCI transaction and is locked. 

IntHwO signal is asserted to interrupt the processor. 

On reads, the 21071-DA chip forces the value 101 (cpuCAck HARD_ERROR) 
on ioCmd<2:0>to end the sysBus transaction. 

To clear the error, a logic 1 must be written to the ioRT bit in the DCSR. 

9.6.2 DMA Transactions 

All DMA transaction errors will be flagged by interrupting the processor 
(IntHwO asserted) when the error occurs, except where noted. 
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9.6.2.1 Address Parity Errors 

On any DMA (PCI -initiated) transaction address phase, if the generated parity 
of the incoming address and command sampled from the PCI AD and C/BE# 
lines is different from the value sampled from PAR, an address parity error 
condition has occurred for that transaction. The following action is taken: 

The 21071-DA chip does not respond to the transaction. Due to the parity 
error, the 21071-DA chip is not certain if the command was correct (read or 
write) and is not sure what the intended address for that transaction was. 
(PERR#is not asserted because it is only intended for data parity errors on the 
PCI.) intHwO is not asserted. 

9.6.2.2 Read Data Parity Errors 

On a DMA read transaction data phase, if there is a parity error it might be 
detected by the PCI master device. Even if this device asserts PERR#and the 
21071-DA chip takes no action, it is the PCI master's responsibility to handle 
the error condition. intHwO is not asserted. 

9.6.2.3 Write Data Parity Errors 

On any DMA write transaction data phase, if the generated parity of the 
incoming data and byte enables sampled from the PCI AD and C/BE#lines is 
different from the value sampled from PAR, a data parity error condition has 
occurred. The following action is taken: 

The 21071-DA chip asserts PERR#pin for one cycle on the PCI two cycles 
after the data was transferred on the bus, to indicate the condition. 

The dDPE bit is set in the DCSR. 

The PCI error address register (PEAR) contains the address that came off 
the PCI bus at the beginning of the transaction and is locked. 

IntHwO signal is asserted to interrupt the processor. 

The write will continue normally on the PCI. 

That particular cache line entry will not be written to memory. 

To clear the error, a logic 1 must be written to the dDPE bit in the DCSR. 
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9.6.2.4 Memory Errors 

On a DMA transaction, if the 21071-CA cinip detects an error (non-existent 
memory address, tag address parity error, or tag control parity error), it will 
log the address and tine specific error bit, and terminate the sysBus transaction 
by driving 11 (DMA cycle error) on ioCAck<I:0>. The following action is tal<en 
if data was going to be transferred on the PCI . 

Note 



Prefetched cache line data may not be required by the PCI device. If 
there is a tag address parity error or tag control parity error on an 
unused prefetched cache line, the error will be logged by the 21071-CA 
chip, but the CPU will not be interrupted by the 21071-DA chip. 



The mErr bit is set. 

The sysBus error address register (SEAR) contains the address that caused 
the memory error and is locked. 

intHwO signal is asserted to interrupt the processor. 

On reads, the 21071-DA chip terminates the PCI transaction using the 
target-abort protocol. 

On writes, the 21071-DA chip dismisses the write buffer entry (single cache 
line). Note that if a single PCI write burst crossed a cache line boundary 
and therefore filled two write buffer entries (two cache lines), each entry is 
handled separately on the sysBus. 

• To clear the error, a logic 1 must be written to the mErr bit in the DCSR. 

9.6.2.5 Read Correctable Data Error 

On a DMA read transaction, if there is a correctable error in memory or Bcache 
and the 21071-BA chips are configured in ECC mode, the 21071-BA chips will 
correct the longword with the single-bit error before sending it to the 21071-DA 
chip. 

If and when this longword is sent to the 21071-DA chip, along with the data on 
theeplData bus, epiBEnErr<l> will contain information whether or not this 
longword had a correctable error. The following action is taken if the longword 
was going to be transferred on the PCI . 
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If the Disable Correctable Error Interrupt bit (dCEl) is set, the information 
on epi BE nErr<l> is ignored. As a result, intHwO will not be asserted, the 
cMRD bit will not beset, and the error address will not be logged in SEAR. 

intHwO signal is asserted to interrupt the microprocessor. 

No error occurs on the PCI and the transaction completes normally. 

ThecMRD bit is set in the DCSR. 

The sysBus error address register (SEAR) contains the address that caused 
the correctable error and is locked. 

To clear the error, a logic 1 must be written to the cMRD bit in the DCSR. 

9.6.2.6 Read Uncorrectable Data Error 

On a DMA read transaction, if there is an uncorrectable error (parity error or 
double-bit ECC error) in memory or Bcache, the 21071-BA chips will inform 
the 21071-DA chip when they send the bad data over the epi Data bus. 

If and when this longword is sent to the 21071-DA chip, along with the data on 
theepiData bus, epiBEnErr<0>will contain information whether this longword 
had an uncorrectable error or not. The following action is taken if the longword 
was going to be transferred on the PCI . (Note: I n some cases, not all longwords 
of a cache line will be transferred.) 

• IntHwO is asserted to interrupt the microprocessor. 

• TheuMRD bit is set. 

• The sysBus error address register (SEAR) contains the address that caused 
the uncorrectable error and is locked. 

• The 21071-DA chip terminates the PCI transaction using the target-abort 
mechanism. 

• To clear the error, a logic 1 must be written to the uMRD bit in the DCSR. 

9.6.2.7 Scatter/Gather Entry Invalid Errors 

On scatter/gather mapped DMA transactions, the scatter/gather entry being 
accessed might be invalid. The actual write to or read from memory will not 
occur. The following action is taken: 

• iPTL bit is set in the DCSR. 

• The PCI error address register (PEAR) contains the address that caused 
the error and is locked. 

• IntHwO signal is asserted to interrupt the processor. 
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• If the scatter/gather read was for a DMA read, the 21071-DA terminates 
the PCI transaction using the target abort protocol. 

• If the scatter/gather read was for a DMA write, the 21071-DA dismisses 
the write buffer entry (single cache line). Note that if a single PCI write 
burst crossed a cache line boundary and therefore filled two write buffer 
entries (two cache lines), each entry is handled separately on thesysBus. 

• To clear the error, a logic 1 must be written to the iPTL bit in the DCSR. 

9.6.2.8 Write Correctable and Uncorrectable Data Errors 

If a DMA write is not a full hexaword, the 21071-CA chip performs a read- 
modify-write. If an error is detected on the read from memory before the write 
is done, the 21071-DA chip does not perform any action. See Section 16.2.2 
for details about how the 21071-BA chip handles data errors on DMA write 
transactions. 

9.6.2.9 Scatter/Gather Correctable Data Error 

On a scatter/gather read transaction, if there is a correctable error in memory 
or Bcache and the 21071-BA chips are configured in ECC mode, the 21071-BA 
chips will correct the longword with the single-bit error before sending it to the 
21071-DA chip. When this longword is sent to the 21071-DA chip, along with 
the data on theepiData bus, epiBEnErr<l> will contain information whether 
or not this longword had a correctable error. The following action is taken: 

• If the Disable Correctable Error Interrupt bit (dCEl) is set, the information 
on epiBEnErr<l> is ignored. As a result, intHwO will not be asserted, the 
cMRD bit will not beset, and the error address will not be logged in SEAR. 

intHwO signal is asserted to interrupt the microprocessor. 

No error occurs on the PCI and the transaction completes normally. 

ThecMRD bit is set in the DCSR. 

The sysBus error address register (SEAR) contains the address that caused 
the correctable error and is locked. 

To clear the error, a logic 1 must be written to the cMRD bit in the DCSR. 
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9.6.2.10 Scatter/Gather Uncorrectable Data Error 

On a scatter/gather read transaction, if there is an uncorrectable error (parity 
error or double-bit ECC error) in memory or Bcaclne, tine 21071-BA chips will 
inform the 21071-DA chip when it sends this data over the epiData bus. 

When this longword is sent to the 21071-DA chip, along with the data on 
the epiData bus, epiBEnErr<0> will contain information whether or not this 
longword had an uncorrectable error. The following action istal<en: 

• intHwO is asserted to interrupt the microprocessor. 

• TheuMRD bit is set. 

• The sysBus error address register (SEAR) contains the address that caused 
the uncorrectable error and is locl<ed. 

• If the scatter/gather read was for a DMA read, the 21071-DA terminates 
the PCI transaction using the target abort protocol. 

• If the scatter/gather read was for a DMA write, the 21071-DA dismisses 
the write buffer entry (single cache line). Note that if a single PCI write 
burst crossed a cache line boundary and therefore filled two write buffer 
entries (two cache lines), each entry is handled separately on the sysBus. 

• To clear the error, a logic 1 must be written to the mErr bit in the DCSR. 

9.6.2.11 Scatter/Gather Memory Errors 

On a DMA transaction, if the 21071-CA detects an error (non-existent memory 
address, tag address parity error, or tag control parity error) during a 
scatter/gather read transaction, it will terminate the sysBus transaction. 
The 21071-CA will log the address and the specific error bit, and terminate 
the sysBus transaction by driving 10 (DMA cycle error) on ioCAcl<<l:0>. The 
following actions are taken by the 21071-DA: 

• The mErr bit is set in the DCSR. 

• The sysBus Error Address Register (SEAR) contains the address that 
caused the memory error and is locl<ed. 

• IntHwO signal is asserted to interrupt the processor. 

• If the scatter/gather read was for a DMA read, the 21071-DA terminates 
the PCI transaction using the target abort protocol. 



9-28 DECchip 21071-DA Architecture Overview 



If the scatter/gather read was for a DMA write, the 21071-DA dismisses 
the write buffer entry (single cache line). Note that if a single PCI write 
burst crossed a cache line boundary and therefore filled two write buffer 
entries (two cache lines), each entry is handled separately on the sysBus. 

To clear the error, a logic 1 must be written to the mErr bit in the DCSR. 
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10 



DECchip 21 071 -DA Programmer's 

Reference 



This chapter provides information about DECchip 21071-DA address 
translation. It also describes the DECchip 21071-DA internal registers. 

10.1 Address Translation 

This section describes the mapping of the 34-bit processor physical address 
space to 32-bit PCI address space, and the translation of the 32-bit PCI 
addresses to 34-bit physical memory space. 

Note 



The slave machine of the DECchip 21071-DA PCI interface will not 
respond to a CPU-initiated address that has been driven onto the PCI 
by the master machine of the PCI interface, even if the address hits the 
programmed PCI DMA window. That is, the DECchip 21071-DA chip 
does not support loopbacl< mode on the PCI . 



10.1.1 CPU Address Mapping to PCI Space 

The 34-bit physical sysBus address space is divided to form: 

Memory address space 

• Local I/O space (local I/O space is used for CSRs in the 21071-CA and 
21071-DA chips) 
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• PCI space 

The PCI defines three physical address spaces: 

- PCI memory (for memory residing on the PCI ) 

- PCI I/O space 

- PCI configuration space 

In addition to these three address spaces on the PCI, the sysBus I/O space is 
also used to generate PCI interrupt acl<nowledge cycles and PCI special cycles. 
Table 10-1 shows the sysBus address mapping required to generate these 
address spaces. 



Table 10-1 sysBus Address Map 



sysAdr<33:32> sysAdr<31 :28> Address Space 



Notes 



00 



xxxx 



Cacheable memory 
space 



The 21071-DA chip does not 
respond to addresses in this 
space. 



01 



01 



01 



01 



oxxx 



lOOX 



1010 



1011 



Noncacheable memory 
space 

21071-CA CSRs 



21071-DA CSRs 



PCI interrupt acknowl- 
edge or PCI special 
cycle 



The 21071-DA chip does not 
respond to addresses in this 
space. 

The 21071-DA chip does not 
respond to addresses in this 
space. 

The 21071-DA chip will 
respond to all addresses in 
this space. Dstream access 
only. 

A read causes a PCI 
interrupt acknowledge 
cycle; a write causes a 
special cycle. Dstream 
access only. 

(continued on next page) 
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Table 10-1 (Cont.) sysBus Address Map 



sysAdr<33:32> sysAdr<31 :28> Address Space 



Notes 



01 



llOX 



PCI sparse 1/0 space 



01 



lllX 



PCI configuration space 



16 JVjB of PCI space. Lower 
256 KB of tliis space must 
be used for addressing PCI, 
EISA, and ISA devices. 
Tlie rest of tlie space can 
be used for otiier devices. 
Dstream access only. 

Refer to Section 10.1.1.6 
for details. Dstream access 
only. 



10 



xxxx 



PCI sparse memory 
space 



11 



XXXX 



PCI dense memory space 



128 MB of PCI space 
addressable. The lower 
address bits are used to 
determine byte masks 
and transaction length 
information, hence the 4 
GB space is reduced to a 
128 M B sparse space. M ust 
use this space when byte 
or word access granularity 
is required. Read or write 
length is no more than a 
quadword. Reading more 
than the requested data is 
harmful. Prefetching read 
data is prohibited. Dstream 
access only. 

4 GB of PCI space. Used 
for devices with access 
granularity greater than 
a longword. Reads do 
not have side effects; 
prefetching of data from 
PCI devices is allowed. 
Typically used for data 
buffers. Dstream access 
only. 
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10.1.1.1 PCI Sparse Memory Space 

20000 0000.. 2FFFF FFFF 

Accesses to this space can have byte, word, tribyte, longword, or quadword 
granularity. The Alpha architecture does not provide byte, word, or tribyte 
granularity, which the PCI requires. Therefore to provide this granularity, the 
byte enable and byte length information are encoded in the lower address bits 
in this space. Address bits <7:3>are used for this purpose. Bits <31:8>are 
used to generate quadword addresses on the PCI, thus resulting in a sparse 4 
GB space that maps to 128 M B of address space on the PCI . An access to this 
space causes a memory read or memory write access on the PCI . 

The mapping is as follows: 

Address <33:32> are used to identify the various address spaces on the sysBus. 
Address <7:3> are used to generate the length of the PCI transaction in bytes, 
the byte enables, and address <2:0>. Refer to Table 10-2. Address <31:8> 
correspond to the quadword PCI addresses and are sent out on AD<26:3> 
during the address phase on the PCI. AD<31:27>are obtained from one of two 
host address extension registers— HAXRO and HAXRl. HAXRO (which is hard 
coded as 0) is used for sysBus addresses between 2 0000 0000 .. 2 1FFF FFFF, 
that is, when sysBus address <31:29> is 0. HAXRl is used to map sysBus 
addresses between 2 2000 0000 .. 2 FFFF FFFF, that is, when sysBus address 
<31:29> is non-zero anywhere in the PCI address space. HAXRl is a CSR 
in the 21071-DA chip and is fully programmable. This allows EISA/ISA 
devices that require memory to be mapped in the lower 16 MB to coexist 
with other devices that do not have that restriction. The lower 16 M B have a 
fixed mapping (HAXRO) to 0, and the remaining 112 MB can be programmed 
anywhere in PCI space. 

Figure 10-1 shows the sysBus to PCI memory address translation. Table 10-2 
shows the generation of the byte enables and the PCI address <2:0>from 
sysBus address <5:3>. 
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Table 10-2 


PCI Sparse 


Memory Space Byte Enable Generation 


Length 


CPU 

Address 

<6:5> 


CPU 

Address 

<4:3> 


PCI Byte 
Enable^ 


PCI Address<2:0>^ 


Byte 


00 


00 


1110 


CPU Address<7>, 00 




01 


00 


1101 


CPU Address<7>, 00 




10 


00 


1011 


CPU Address<7>, 00 




11 


00 


0111 


CPU Address<7>, 00 


Word 


00 


01 


1100 


CPU Address<7>, 00 




01 


01 


1001 


CPU Address<7>, 00 




10 


01 


0011 


CPU Address<7>, 00 




11 


01 


IllegaP 


— 


Tribyte 


00 


10 


1000 


CPU Address<7>, 00 




01 


10 


0001 


CPU Address<7>, 00 




10 


10 


IllegaP 


— 




11 


10 


IllegaP 


— 


Longword 


00 


11 


0000 


CPU Address<7>, 00 


Longword 


01 


11 


IllegaP 


— 


Longword 


10 


11 


IllegaP 


— 


Quadword 


11 


11 


0000 


000 



^Byte enable set to indicates tliat byte lane rarries meaningful data. 

2|n PCI sparse memory space, PCI address <1:0> are always 00. 

^These combinations are architecturally illegal. If there is an access with this combination 
of address<6:3>, then the 21071-DA will respond to the transactions, but the results are 
unpredictable. 
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Figure 10-1 PCI Memory Space Address Translation 
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HAXRO 
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(Refer to Table for Translation) 
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Address Translation for Lower 16 MB of PCI Memory Space 



33 32 31 30 29 28 



08 07 06 05 04 03 



Non-Zero 



HAXR1<31:27> 



Lengtfi in Bytes 
Byte Offset 



Longword Address 

(Refer to Table for Translation) 



31 


30 


29 


28 


27 


26 


03 


02 


01 


00 


1 1 













Address Translation for Remaining 1 1 2 MB of PCI Memory Space 



It is important to note tinat sysBus address<33:5> are directly aval I able from 
the Alpha 21064 microprocessor. sysBus address<4:3> have to be derived 
from the longword masks, cpuCWMasl«7:0>. On read transactions, the 
DECchip 21064 sends out address bits <4:3>on cpuCWMask<l:0>. On write 
transactions, the relationship between cpuCWMask<7:0>and address bits 
<4:3>is as follows: 
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If cpuCWMask<l:0>is non-zero, then address <4:3>is 00. 
If cpuCWMask<3:2>is non-zero, then address <4:3>is 01. 
If cpuCWMask<5:4>is non-zero, then address <4:3>is 10. 
If cpuCWMask<7:6>is non-zero, then address <4:3>is 11. 

Note 



Accesses in this space are no longer than a quadword. Software must 
ensure that the processor does not merge consecutive writes in its write 
buffers by using memory barriers after each write. 

Architecturally, if a byte, word, tribyte, or longword must be written on 
the PCI, an STL instruction must be done to the lower longword in the 
corresponding quadword address. An STQ or an STL instruction to the 
upper longword is not allowed. One bit-pair among cpuCWMask<l:0>, 
<3:2>, <5:4>, and <7:6> must have a value of 01 (binary); the other 
bit fields must all be 00 (binary). The location of the 01 (binary) field 
indicates whether the reference is byte, word, tribyte, or longword 
(respectively) in length. 

Similarly, if a quadword has to be written to the PCI, software must do 
an STQ instruction to the corresponding address; the only legal value 
on cpuCWMask<7:0>in sparse space is 11000000 (binary). 

If a byte, word, tribyte, or longword has to be read from the PCI, 
an LDL instruction must be done to the lower longword in the 
corresponding quadword address. An LDL instruction to the upper 
longword or LDQ instruction will return the wrong data. If a quadword 
has to be read from the PCI, software must use an LDQ instruction. 
An LDL instruction will return wrong data. 



10.1.1.2 PCI Dense Memory Space 

30000 0000.. 3FFFF FFFF 

PCI dense memory space is typically used for data buffers on the PCI and has 
the following characteristics: 

• There is a one-to-one mapping between CPU addresses and PCI addresses. 
A longword address from the CPU maps to a longword on the PCI . Hence 
the name dense space (as opposed to PCI sparse memory space). 

• Byte or word accesses are not allowed in this space. Minimum access 
granularity is a longword. The maximum transfer length implemented by 
the 21071 and 21072 chipsets is a cache line (32 bytes) on writes and a 
quadword on reads. 
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• Read prefetching is allowed in this space; extra reads have no side effects. 
The DECchip 21064 microprocessor does not specify a longword address on 
read transactions; it only specifies a quadword address. Therefore, reads in 
this space will always be done as a quadword read with a burst length of 
two on the PCI. 

• Writes to addresses in this space can be buffered in the DECchip 21064 
microprocessor. The 21071 and 21072 chipsets support a maximum burst 
length of 8 on the PCI corresponding to a cache line of data. 

The address generation in dense space is as follows: 

CPU address <31:5> is directly sent out on PCI address <31:5>. On read 
transactions, PCI address <4:3> is generated from cpuCWMask<l:0>, PCI 
address <2> is always 0. 

On write transactions, PCI address <4:2>is generated from cpuCWMasl«7:0>. 
If the lower longword is to be written, PCI address <2> is 0; if the lower 
longword is masked out and the upper longword is to be written, PCI address 
<2> is 1. The number of longwords written on the PCI is directly obtained 
from cpuCWMask<7:0>. Any combination of cpuCWMask<7:0> is allowed by 
the 21071 or 21072 chipsets. 

Note 



If the cache line written by the processor has holes, that is, if some of 
the longwords have been masked out, the corresponding transfer will 
still be performed on the PCI with disabled byte enables. Downstream 
bridges must be able to deal with completely disabled byte enables on 
the PCI during write transactions. 



10.1.1.3 PCI Sparse I/O Space 

ICOOOOOOO.. IDFFF FFFF 

The PCI sparse I/O space is sparse and has similar characteristics to the PCI 
sparse memory space. This 512 M B sysBus address space maps to 16 M B of 
PCI I/O address space. A read or write to this space causes a PCI I/O read or 
PCI I/O write command respectively. The address generation is as follows: 

Address <33:29> are used to identify the various address spaces on the sysBus. 
Address <7:3> are used to generate the length of the PCI transaction in bytes, 
the byte enables, and address <2:0>on the PCI (Table 10-3). Address <28:8> 
correspond to the quadword PCI addresses and are sent out on AD<23:3> 
during the address phase on the PCI. AD<31:24>are obtained from one of two 
host address extension registers— HAXRO and HAXR2. 
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HAXRO (which is hard coded as 0) is used for sysBus addresses between 
1 COOO 0000 .. 1 C07F FFFF, that is, when sysBus address <28:23> is 0. 
HAXR2 is used to map sysBus addresses between 1 C080 0000 .. 1 DFFF FFFF, 
that is, when sysBus address <28:23>is non-zero, anywhere in the PCI address 
space. HAXR2 is a C5R in the 21071-DA chip and is fully programmable. This 
allows EISA/ISA devices that require their I/O space to be in the lower 256 
KB, to coexist with other devices that do not have that restriction. The lower 
256 KB have a fixed mapping (HAXRO) to 0, and the remaining 64 MB - 
256 KB can be programmed anywhere in PCI space. Figure 10-2 shows the 
sysBus to PCI I/O address translation. Table 10-3 describes the generation of 
the byte enables, and the PCI address<2:0>from sysBus address <5:3>. 

Table 10-3 PCI Sparse I/O Space Byte Enable Generation 





CPU 


CPU 








Address 


Address 


PCI Byte 




Length 


<6:5> 


<4:3> 


Enable^ 


PCI Address <2:0> 


Byte 


00 


00 


1110 


CPU Address<7>, 00 




01 


00 


1101 


CPU Address<7>, 01 




10 


00 


1011 


CPU Address<7>, 10 




11 


00 


0111 


CPU Address<7>, 11 


Word 


00 


01 


1100 


CPU Address<7>, 00 




01 


01 


1001 


CPU Address<7>, 01 




10 


01 


0011 


CPU Address<7>, 10 




11 


01 


Illegal^ 


— 


Tribyte 


00 


10 


1000 


CPU Address<7>, 00 




01 


10 


0001 


CPU Address<7>, 01 




10 


10 


Illegal 


— 




11 


10 


Illegal^ 


— 


Longword 


00 


11 


0000 


CPU Addrpss<7>, 00 


Longword 


01 


11 


Illegal^ 


— 


Longword 


10 


11 


Illegal^ 


— 


Quadword 11 


11 


0000 


000 



^Byte enable set to indicates tliat byte lane carries meaningful data. 

^These combinations are architecturally illegal. If there is an access with this combination of 
address<5:3>, the 21071-DA will respond to the transactions, but the results are unpredictable. 
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Warning 

Quadword accesses to this PCI sparse I/O space will cause a two 
longword burst on the PCI. PCI devices cannot support bursting in I/O 
space. 



Figure 10-2 PCI I/O Space Address Translation 
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Address Translation for Lower 256 KB of PCI I/O Space 



33 32 31 


30 


29 


28 




23 


22 


08 


07 


06 05 


04 03 


1 1 


1 





III 
Non-Zero 

III 
























' 


^»- 


HAXR2<31:24> 

1 






1 


' 




Longword Address 
{Refer to Table for Tr 

r 



Length in Bytes 



03 02 01 00 



Address Translation for Remaining 16 MB of PCI I/O Space 



10.1.1.4 DECchip 21071-DA CSR Space 
lAOOOOOOO.. lAFFF FFFF 

All the 21071-DA CSRs are mapped in the DECchip 21071-DA CSR space. The 
21071-DA chip responds to all accesses in this space. 
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10.1.1.5 PCI Interrupt Acknowledge/Special Cycle Space 
IBOOOOOOO.. IBFFF FFFF 

A read access to this address space causes an interrupt acl<nowledge cycle on 
the PCI . The byte enable generation mechanism is based on address<6:3> and 
is the same as that of the PCI sparse I/O space. See Table 10-3. The address 
is a don't care during this transaction. 

A write access to this space causes a special cycle on the PCI . The address and 
byte enables are don't care during this transaction. 

Note 



Software must use an STL instruction to initiate these transactions. 
An STQ instruction will result in a two longword burst on the PCI, 
which is illegal. 



10.1.1.6 PCI Configuration Space 

lEOOOOOOO.. IFFF FFFF 

A read or write access to this space causes a configuration read or write 
cycle on the PCI . There are two classes of targets— devices on the primary 
PCI bus and devices on secondary PCI buses that are accessed through PCI- 
to-PCI bridge chips. During PCI configuration cycles, the meanings of the 
address fields vary depending on the intended target of the configuration cycle. 
AD<1:0>, which are supplied by the HAXR2 register, indicate the target bus: 

AD<1:0> equal to 00 indicates the primary PCI bus. 
AD<1:0> equal to 01 indicates a secondary PCI bus. 

Table 10-4 defines the various fields of AD during the address phase of a 
configuration read or write cycle. 
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Table 10-4 PCI Configuration Space Definition 



Target Bus 



AD Bits Definition 



Primary PCI Bus 



<31:11> 



<10:8> 

<7:2> 

<1:0> 



Decoded from sysAdr<20:16> according to 
Table 10-5. 

Can be used for IDSEL#or don't cares. 
Typically, the I DSEL#pin of each device is 
connected to a unique AD line. 

Function select (1 of 8), from sysAdr<15:13>. 

Register select, from sysAdr<12:7>. 

00, fromHAXR2<l:0> 



Secondary PCI Buses 

(lUlust pass through a PCI-to-PCI bridge) 



<31:24> Forced to by the 21071-DA chip. 

<23:16> Secondary bus number, from sysAdr<28:21>. 

<15:11> Device number, from sysAdr<20:16>. 

<10:8> Function select (1 of 8), from sysAdr<15:13>. 

<7:2> Register select, from sysAdr<12:7>. 

<1:0> 01, fromHAXR2<l:0> 



10.1.1.6.1 PCI Configuration Cycles to Primary Bus Targets Primary PCI 
bus devices are selected during a PCI configuration cycle if their IDSEL#pin 
is asserted, the PCI bus command indicates a configuration read or write, 
and AD<l:0>are 00. AD<7:2>, which are taken from sysAdr<12:7>, select 
a longword register in the device's 256-byte configuration address space. 
Configuration accesses can use byte masks, which may be derived by following 
the method shown in Table 10-3. 

Peripherals that integrate multiple functional units (for example, SCSI and 
Ethernet) can provide configuration spaces for each function. AD<10:8>, which 
are taken from sysAdr<15:13>, can be decoded by the peripheral to select one 
of eight functional units. 

AD<31:ll>are used to generate the IDSEL#signals. Typically, the IDSEL# 
pin of each PCI peripheral is connected to a unique address line. AD<31:11> 
are decoded from sysAdr<20:16> according to Table 10-5, ensuring that only 
one bit of AD<31:11> is asserted for any given configuration space transaction 
on the primary PCI bus. sysAdr<28:21>are ignored. 
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Table 10-5 PCI Address Decoding for Primary Bus Configuration Accesses 

Device Number 

(sysAdr<20:16>) PCIAD<31:11> 

00000 0000 0000 0000 0000 0000 1 

00001 0000 0000 0000 0000 0001 

00010 0000 0000 0000 0000 0010 

00011 0000 0000 0000 0000 0100 

00100 0000 0000 0000 0000 1000 

00101 0000 0000 0000 0001 0000 

00110 0000 0000 0000 0010 0000 

00111 0000 0000 0000 0100 0000 

01000 0000 0000 0000 1000 0000 

01001 0000 0000 0001 0000 0000 

01010 0000 0000 0010 0000 0000 

01011 0000 0000 0100 0000 0000 

01100 0000 0000 1000 0000 0000 

01101 0000 0001 0000 0000 0000 
OHIO 0000 0010 0000 0000 0000 
01111 0000 0100 0000 0000 0000 

10000 0000 1000 0000 0000 0000 

10001 0001 0000 0000 0000 0000 

10010 0010 0000 0000 0000 0000 

10011 0100 0000 0000 0000 0000 

10100 1000 0000 0000 0000 0000 

10101 0000 0000 0000 0000 0000 

10110 0000 0000 0000 0000 0000 

10111 0000 0000 0000 0000 0000 

11000 0000 0000 0000 0000 0000 

11001 0000 0000 0000 0000 0000 

11010 0000 0000 0000 0000 0000 

11011 0000 0000 0000 0000 0000 

11100 0000 0000 0000 0000 0000 

11101 0000 0000 0000 0000 0000 

11110 0000 0000 0000 0000 0000 

11111 0000 0000 0000 0000 0000 
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10.1.1.6.2 PCI Configuration Cycles to Secondary Bus Targets If the PCI 

cycle is a configuration read or write cycle but AD<l:0>are 01, then a device 
on a secondary PCI bus is being selected across a PCI-to-PCI bridge. This 
cycle will be accepted by a PCI-to-PCI bridge for propagation to its secondary 
PCI bus. During this cycle, AD<23:16>, taken from sysAdr<28:21>, select a 
unique bus number; AD<15:11>, taken from sysAdr<20:16>, select a device on 
that bus (typically decoded by the target bridge to generate I DSEL#signals); 
AD<10:8>, taken from sysAdr<15:13>, select one of eight functional units 
per device; and AD<7:2>, taken from sysAdr<12:7>, select a longword in the 
device's configuration register space. 

Each PCI-to-PCI bridge device can be configured using PCI configuration cycles 
on its primary PCI interface. Configuration parameters in the PCI-to-PCI 
bridge will identify the bus number for its secondary PCI interface and a range 
of bus numbers that may exist hierarchically behind it. 

If the bus number of the configuration cycle matches the bus number of the 
bridge chip's secondary PCI interface, then it will intercept the configuration 
cycle, decode it, and generate a PCI configuration cycle with AD<1:0> equal to 

00 on its secondary PCI interface. If the bus number is within the range of bus 
numbers that may exist hierarchically behind its secondary PCI interface, the 
PCI configuration cycle passes, unmodified (leaving AD<1:0> = 01), through 
the bridge. The configuration cycle will be intercepted and decoded by a 
downstream bridge. 

10.1.2 PCI To Physical Memory Addressing 

1 ncoming 32-bit PCI memory addresses have to be mapped to the 34-bit 
physical memory addresses. The 21071-DA chip allows two regions in PCI 
memory space to be mapped to system memory with two programmable 
address windows. The mapping from the PCI address to the physical address 
can be direct (physical mapping with an extension register) or scatter/gather 
mapped (virtual). These two address windows are referred to as the PCI target 
windows. Each window has three registers associated with it. These are: 

• PCI base register 

• PCI mask register 

• Translated base register 

The PCI mask register provides a mask corresponding to bits <31:20>of an 
incoming PCI address. The size of each window can be programmed to be from 
1 MB to 4 GB, in powers of two, by masking bits of the incoming PCI address 
using the PCI mask register. Table 10-6 shows an example of this. 
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Table 10-6 PCI Target Window Enables 



0000 0000 0000 


1 MB 


0000 0000 0001 


2 MB 


0000 0000 0011 


4 MB 


0000 0000 0111 


8 MB 


0000 0000 1111 


16 MB 


0000 0001 1111 


32 MB 


0000 0011 1111 


64 MB 


0000 0111 1111 


128 MB 


0000 1111 1111 


256 MB 


0001 1111 1111 


512 MB 


0011 1111 1111 


1GB 


0111 1111 1111 


2 GB 


1111 1111 1111 


4GB3 



pci_Mask<31 :20>^ Size of Window Value of n^ 

20 
21 
22 
23 
24 
25 
26 
27 
28 
29 
30 
31 
32 

^Combinations of bits in pd_Masl<<31:20>tliat are not sliown in Table 10-6 are not supported. 

^Depending upon tine target window size, only the incoming address bits <31:n>are compared 
with bits <31:n>of the PCI base registers, as shown in Figure 10-3 (n =20 to 32). If n=32, no 
comparison is performed, n is also used in Figure 10-5. 

^When this combination is chosen, the wEnb bit in the other PCI base register has to be cleared, 
otherwise the two windows will overlap. 



Based on the value of the PCI mask register, the unmasked bits of the incoming 
PCI address are compared with the corresponding bits of each window's PCI 
base register. If the base registers and the incoming PCI address match, 
then the incoming PCI address has hit in that PCI target window; otherwise, 
the incoming PCI address has missed in that window. A window enable bit, 
wEnb, is provided in each window's PCI base register to allow windows to be 
independently enabled or disabled. If a window's wEnb bit is set, then the 
window is enabled. The PCI target windows must be programmed so that the 
PCI address ranges that each one responds to do not overlap. The compare 
scheme between the incoming PCI address and the PCI base register (along 
with the PCI mask register) described previously is shown in Figure 10-3. 

Note 



The window base addresses should be on naturally aligned address 
boundaries depending on the size of the window. 
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Figure 10-3 PCI Target Window Compare 
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When an address match occurs with a PCI target window, the 21071-DA chip 
translates the 32-bit PCI address to a 34-bit processor byte address (actually 
a 29-bit hexaword address). The translated address is generated in one of two 
ways as determined by the scatter/gather bit of the window's PCI base register. 

If the scatter/gather bit is cleared, the DMA address is direct mapped, and 
the translated address is generated by concatenating bits from the matching 
window's translated base register with bits from the incoming PCI address. 
The PCI mask register determines which bits of the translated base register 
and PCI address are used to generate the translated address, as shown in 
Table 10-7. 
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Note 

The unused bits of the translated base register indicated in Table 10-7 
must be cleared for proper operation. Because system memory is 
located in the lower half of the CPU address space, address <33> is 
always 0. Address <32:5> is obtained from the translated base register. 



Table 10-7 PCI Target Address Translation— Direct Mapped (Scatter/Gather 
Mapping Disabled) 



pci_Mask<31 :20> 



Translated Address<32:5> 



0000 0000 0000 
0000 0000 0001 
0000 0000 0011 
0000 0000 0111 
0000 0000 1111 
0000 0001 1111 
0000 0011 1111 
0000 0111 1111 

0000 1111 1111 

0001 1111 1111 
0011 1111 1111 
0111 1111 1111 

1111 1111 1111 



t_Base<32:20> :pci_Address<19:5> 
t_Base<32:21> :pci_Address<20:5> 
t_Base<32:22> :pci_Address<21:5> 
t_Base<32:23> :pci_Address<22:5> 
t_Base<32:24> :pci_Address<23:5> 
t_Base<32:25> :pci_Address<24:5> 
t_Base<32:26> :pci_Address<25:5> 
t_Base<32:27> :pci_Address<26:5> 
t_Base<32:28> :pci_Address<27:5> 
t_Base<32:29> :pci_Address<28:5> 
t_Base<32:30> :pci_Address<29:5> 
t_Base<32:31> :pci_Address<30:5> 
t_Base<32>: pci_Address<31:5> 



If the scatter/gather bit is set, then the translated address is generated by 
a table lookup. The incoming PCI address is used to index a table stored 
in system memory. This table is referred to as a scatter/gather map. The 
translated base register specifies the starting address of the scatter/gather map 
table. Bits of the incoming PCI address are used as an offset from the base of 
the table. The map entry provides the physical address of the page. 
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Each scatter/gather map entry maps an 8 KB page of PCI address space 
into an 8 KB page of the processor's address space. Each scatter/gather map 
entry is a quadword. Each entry has a valid bit in bit position 0. Address 
bit <L3> is at bit position 1 of tine map entry. Because the DECchip 21071 
and DECchip 21072 chipsets implement only valid memory addresses up to 
6 GB, bits <53:21>of the scatter/gather map entry should be programmed to 
0. Bits <20:1> of the scatter/gather entry are used to generate the physical 
page address. This is appended to bits <12:5>of the incoming PCI address to 
generate the memory address that needs to go out on the sysBus. Figure 10-4 
shows the scatter/gather map entry. 

The size of the scatter/gather map table is determined by the size of the PCI 
target window as defined by the PCI mask register (Table 10-8). Because the 
scatter/gather map is located in system memory, translated address <33>is 
always 0. Address<32:3>are obtained from translated base register and the 
PCI address, as shown in Table 10-8. 



Table 10-8 Scatter/Gather Map Address 



pci_Mask<31 :20> 



Scatter/Gather 
Map Table Size 



Scatter/Gather Map Address<32:3> 



0000 


0000 


0000 


1KB 


t_Base<32:10>:pci 


_Address<19:13> 


0000 


0000 


0001 


2 KB 


t_Base<32:ll>:pci. 


_Address^0:13> 


0000 


0000 


0011 


4KB 


t_Base<32:12>:pci 


_Address^l:13> 


0000 


0000 


0111 


8KB 


t_Base<32:13>:pci 


_Address<22:13> 


0000 


0000 


1111 


16KB 


t_Base<32:14>:pci 


_Address<23:13> 


0000 


0001 


1111 


32KB 


t_Base<32:15>:pci 


_Address<24:13> 


0000 


0011 


1111 


64KB 


t_Base<32:16>:pci 


_Address<25:13> 


0000 


0111 


1111 


128KB 


t_Base<32:17>:pci 


_Address<26:13> 


0000 


1111 


1111 


256KB 


t_Base<32:18>:pci 


_Address^7:13> 


0001 


1111 


1111 


512 KB 


t_Base<32:19>:pci 


_Address<28:13> 


0011 


1111 


1111 


1MB 


t_Base<32:20> :pci 


_Address<29:13> 


0111 


1111 


1111 


2IV1B 


t_Base<32:21> :pci 


_Address<30:13> 


1111 


1111 


1111 


4MB 


t_Base<32:22> :pci 


_Address<31:13> 
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Figure 10-4 Scatter/Gather Map Page Table Entry in Memory 
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Figure 10-5 shows the entire translation from PCI address to physical address 
on a window that implements scatter/gather mapping. 

The process is as follows: 

• Bits <12:2>of the PCI address are used directly to generate the page offset. 

• The relevant bits of the PCI address (as specified by the window mask 
register, depending on the size of the window) are used to generate the 
offset into the scatter/gather map. 

• The relevant bits of the translated base register indicate the base address 
of the scatter/gather map. 

• The map base is appended to the map offset to generate the address of the 
corresponding scatter/gather entry. 

• Bits <20:1> of the map entry are used to generate the physical page 
address, which is appended to the page offset to generate the PCI address. 

• Bit 0>is the valid bit for the Page Table Entry. 
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Figure 10-5 Scatter/Gather Map Translation of PCI to sysBus Address 



PCI Address 



31 n 



13 I 12 05 I 04 00 



Peripheral Page Number 


Offset 





sysBus 
Base Address 
(Translated 
Base Register) 



Scatter/Gather 
Map Address 
Driven on 
sysBus 



Scatter/Gather Map 
{in Main Memory) 



Physical Memory 
Location Address 
Driven on sysBus 



Compare 



n-10 I n-11 07 






t_Base 


0000 











sysBus Page Number ::: 


Offset 



10-20 DECchip 21 071 -DA Programmer's Reference 



10.2 DECchip 21071-DA Internal Registers 

This section provides a summary of tine DECcliip 21071-DA internal registers, 
and it describes each register. 

10.2.1 Register Overview 

The control and status register (CSR) addresses are listed in Table 10-9. All 
registers are longword and are addressed on cache line boundaries (address 
<4:2> must be 0). Writes to read-only registers do not cause errors and are 
acknowledged as normal. Only O's should be written to unspecified bits within 
a register. Registers are initialized as specified in the detailed descriptions in 
this chapter. Addresses in CSR space that are not specified here should not be 
read or written. 



Table 10-9 DECchip 21071-DA Register Summary 



Address (hex) 



Register Name 



1 AOOO 0000 
1 AOOO 0020 
1 AOOO 0040 
1 AOOO 0060 
1 AOOO 0080 
1 AOOO OOAO 
1 AOOO OOCO 
1 AOOO OOEO 
1 AOOO 0100 
1 AOOO 0120 
1 AOOO 0140 
1 AOOO 0160 
1 AOOO 0180 
1 AOOO OlAO 
1 AOOO OICO 
1 AOOO OlEO 
1 AOOO 0200 
1 AOOO 0220 



Diagnostic control and status register (DCSR) 

PCI error address register (PEAR) 

sysBus error address register (SEAR) 

Dummy register 1 

Dummy register2 

Dummy registers 

Translated base 1 register 

Translated base 2 register 

PCI base 1 register 

PCI base 2 register 

PCI mask 1 register 

PCI mask 2 register 

Host address extension register (HAXRO) 

Host address extension register 1 (HAXRl) 

Host address extension register 2 (HAXR2) 

PCI master latency timer (PMLT) 

TLB tag register 

TLB tag 1 register 

(continued on next page) 
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Table 10-9 (Cont.) DECchip 21071-DA Register Summary 



Address (hex) 



Register Name 



1 AOOO 0240 
1 AOOO 0260 
1 AOOO 0280 
1 AOOO 02A0 
1 AOOO 02C0 
1 AOOO 02E0 
1 AOOO 0300 
1 AOOO 0320 
1 AOOO 0340 
1 AOOO 0360 
1 AOOO 0380 
1 AOOO 03A0 
1 AOOO 03C0 
1 AOOO 03E0 
1 AOOO 0400 



TLB tag 2 register 

TLB tag 3 register 

TLB tag 4 register 

TLB tag 5 register 

TLB tag 6 register 

TLB tag 7 register 

TLB data register 

TLB data 1 register 

TLB data 2 register 

TLB data 3 register 

TLB data 4 register 

TLB data 5 register 

TLB data 6 register 

TLB data 7 register 

Translation buffer invalidate all register (TBI A) 



10.2.2 Register Descriptions 

Tinis section provides registers descriptions. 

10.2.2.1 Dummy Registers 1-3 

Dumnny registers are CSRs that have no side effects on writes, and return O's 
on reads. Writes to these registers can be used to pack the DECchip 21064 
write buffers to prevent merging of sparse space I/O writes. Software does not 
have to use memory barrier instructions between writes if this mechanism is 
used. 

10.2.2.2 Diagnostic Control and Status Register (DCSR) 

The DCSR provides control of operational and diagnostic modes, and reports 
status and error conditions. Figure 10-6 shows the register bit assignments, 
and Table 10-10 provides the bit descriptions for the diagnostic control and 
status register. 
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Figure 10-6 Diagnostic Control and Status Register (DCSR) 
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pCmd 
Reserved 
pass2 
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Table 10-10 Diagnostic Control and Status Register 



Field 



Type, 
Extent Reset 



Description 



pass2 

Reserved 
pCmd 



<31> 



RO,- 



<30:22> MBZ 
<21:18> RO,- 



Chip version reads low on passl and reads 
high on pass2. 



The pCmd field indicates the PCI cycle type 
when a PCI -initiated error is logged in the 
DCSR. This field is only valid when iPTL, 
nDev, tAbt, or ioPE errors are set. 



(continued on next page) 
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Table 10-10 (Cont.) Diagnostic Control and Status Register 



Field 



Type, 
Extent Reset 



Description 



dByp<l:0> <17:15> RW, 



The disable read bypass bits are used to 
control the ordering of PCI -initiated memory 
reads with respect to PCI -initiated memory 
writes. This field has three modes: 



Value 


Mode 


Description 


00 


Full 


In this mode, PCI- 




Bypass 


initiated memory reads 
will bypass buffered DMA 
writes if the double- 
hexaword address of the 
read does not match that 
of the buffered writes. 
The address comparison 
is done across address 
bits <31:6>. 


01 


Reserved 


— 


10 


Partial 


In this mode, DMA reads 




Bypass 


will bypass buffered 



memory writes if the 
address within the page 
does not match that 
of the buffered DMA 
writes. The address 
comparison is done across 
bits <12:6>. 
11 No In this mode, DMA read 

Bypass bypassing is completely 
disabled. DMA reads 
will be ordered with 
respect to DMA writes 
originating on the PCI . 

(continued on next page) 
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Table 10-10 (Cont.) Diagnostic Control and Status Register 



Field 



Type, 
Extent Reset 



Description 



mErr 



iPTL 



uMRD 



<15> RWC.O The memory error (mErr) bit is set when 

the 21071-DA chip receives an error code 
in the ioCAcl«l:0> field in response to 
a memory access. sysAdr<33:5> for this 
transaction is logged in the sysBus error 
address register <31:4>. This bit is not logged 
if the sysBus error address register is locked 
by a previous error. The lost error bit is set 
instead. 

If the mErr bit and either the cMRD or the 
uM RD bits are set, this indicates that the 
address for the mErr is lost. 

<14> RWC.O The invalid page table lookup (iPTL) bit is set 

when the longword scatter/gather map entry 
being accessed is invalid. (See Figure 10-4.) 
AD<31:0>is logged in the PCI error address 
register, if it is not already locked. 

If the IPTL bit and any of the ioRT, ioPE, 
nDev tAbt, and dDPE bits are set, this 
indicates that the address for the IPTL is 
lost. 

<13> RWC,0 The uncorrectable memory read data (uM RD) 

bit is set when an uncorrectable error is 
encountered by the 21071-DA chip. The error 
is encountered when the data read from 
the DMA read buffer in the 21071-BA chip 
reaches the 21071-DA chip on a DMA read or 
a scatter/gather read transaction. 

sysAdr<33:6>for this transaction is logged in 
the sysBus error address register <31:4>, if the 
SEAR is not locked. 

(continued on next page) 



DECchip 21071-DA Programmer's Reference 10-25 



Table 10-10 (Cont.) Diagnostic Control and Status Register 



Field 



Type, 
Extent Reset 



Description 



CMRD 



<12> RWCO 



nDev 



<11> 



RWCO 



tAbt 



ioPE 



dDPE 



Reserved 



<10> 



<9> 



<8> 



<7> 



RWCO 



RWCO 



RWCO 



MBZ 



The correctable memory read data (cMRD) 
bit is set when a correctable error Is 
encountered by the 21071-DA chip. The error 
is encountered when the data read from the 
DMA read buffer in the 21071-BA reaches the 
21071-DA on a DMA read or a scatter/gather 
read transaction. 

sysAdr<33:6>for this transaction is logged 
in the sysBus error address register <31:4> if 
the SEAR is not locked. The logging of this 
error can be prevented by setting the disable 
correctable error (dCEl) in this register. 

The no device (nDev) bit is set when 
DEVSEL#is not asserted in response to 
an I/O read or write transaction initiated on 
the PCI by the 21071-DA chip. AD<31:0>for 
this transaction is logged in the PCI error 
address register <31:0>. 

The target abort (tAbt) bit is set when a PCI 
slave device ends an I/O read or write trans- 
action using the PCI target abort protocol. 
AD<31:0>for this transaction is logged in the 
PCI error address register <31:0>. 

The I/O parity error (ioPE) bit is set when a 
parity error occurs in the data phase of an 
I/O read or I/O write transaction. AD<31:0> 
for this transaction is logged in the PCI error 
address register <31:0>. 

The DMA data parity error (dDPE) bit is set 
when a parity error occurs in the data phase 
of a DMA transaction. AD<31:0>for this 
transaction is logged in the PCI error address 
register <31:0>. 



(continued on next page) 



10-26 DECchip 21071-DA Programmer's Reference 



Table 10-10 (Cont.) Diagnostic Control and Status Register 



Field 



Type, 
Extent Reset 



Description 



lost 



<6> 



RWCO 



ioRT 



<5> 



RWC, 



dPEC 



dCEl 



<4> 



<3> 



RW,0 



RW,0 



pEnb 



<2> 



RW,0 



Reserved 



<1> 



MBZ 



The lost error (lost) bit is set by the occurrence 
of an 21071-DA chip error condition when 
the address register corresponding to that 
error is locked because of a previous error. 
Under those circumstances, error information 
pertaining to the second error is lost. The 
logged address information in the sysBus 
error address register or the PCI error address 
register still remains valid for the initial error 
condition indicated by the error bit already 
set. 

This bit is set when a retry timeout error 
occurs on CPU-initiated write or read 
transactions on the PCI . AD<31:0> is logged 
in the PCI error address register. 

This bit is also set in the event that the 
21071-DA chip sees GntL deassert during the 
address portion of a configuration transaction 
2^"* consecutive times. 

When the disable parity error checking 
(dPEC) bit is set, parity checking will not 
be performed on the PCI bus (address and 
data cycles, DMA and 1/0 transactions). 
Parity generation is not affected in any way. 

When the disable correctable error interrupt 
(dCEl) bit is set, correctable errors on DMA 
read data are not logged in the cM RD bit 
(DCSR12), and the address is not updated in 
the sysBus error address register. This bit 
only determines whether the error is logged 
and if the processor is interrupted. 

If the prefetch enable (pEnb) bit is set, 
the sysBus master machine will enable 
prefetching on DMA reads. This bit should be 
self cleared following system reset and should 
not be changed while DMA operations are 
going on. 



(continued on next page) 
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Table 10-10 (Cont.) Diagnostic Control and Status Register 

Type, 
Field Extent Reset Description 

tEnb <0> RW,0 When the TLB enable (tEnb) bit is set, the 

entire translation buffer (TLB) is enabled. 
When this bit is cleared, the TLB will be 
turned off and subsequent scatter/gather 
reads will not result in allocation of TLB 
entries. Entries that were valid when the 
tEnb bit was cleared will remain valid. To 
invalidate valid entries, software must write 
to the TBI A register. 
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10.2.2.3 PCI Error Address Register 

Figure 10-7 shows the register bit assignments, and Table 10-11 provides the 
bit descriptions for the PCI error address register. 

Figure 10-7 PCI Error Address Register 
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00 


pcLErr <31:0> 



1 AOOO 0020 
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Table 10-11 PCI Error Address Register 



Type, 
Field Extent Reset Description 



pci_Err<31:0> <31:0> RO,— The address sent out on the PCI 

bus (AD<31:0>) as a result of an I/O 



transaction is stored here. This field logs 
the address of the errors indicated by the 
nDev, tAbt, ioPE, dDPE, iPTL, and ioRT 
bits in the DCSR. This register is valid 
only when one of these six error bits is set. 
If one of these six error bits is set, then a 
subsequent nDev, tAbt, ioPE, dDPE, iPTL, 
or ioRT error will not update the address 
logged in this register, and the lost bit in 
DCSR is set. 

pci_Err<31:0> are valid for nDev and 
iPTL. Only pci_Err<31:5> are valid for 
ioRT, tAbt, and ioPE errors that occur 
during dense memory writes. For ioRT, 
tAbt, and ioPE errors on any other 
transaction, pci_Err<31:3> are valid. 
pci_Err<31:6> are valid for dDPE errors. 
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10.2.2.4 sysBus Error Address Register 

Figure 10-8 shows the register bit assignments, and Table 10-12 provides the 
bit descriptions for the sysBus error address register. 



Figure 10-8 sysBus Error Address Register 
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03 02 01 00 



sys Err <33:6> 



1 AOOO 0040 



Reserved 
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Table 10-12 sysBus Error Address Register 



Field 



Type, 
Extent Reset 



Description 



sys_Err<33:6> <31:4> RO,- 



Reserved 



<3:0> 



MBZ 



The address sent exit on the sysBus. 
(sysAdr<33:6> as a result of a DMA 
transaction is stored here.) This field 
logs the address of the errors indicated 
by themErr, uMRD, or cMRD bits in the 
DCSR. This register is valid only when one 
of these three error bits is set. If one of 
these three error bits is set, a subsequent 
mErr, uMRD, or cMRD error will not 
update the address logged in this register, 
and the lost bit in DCSR is set. 
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10.2.2.5 Translated Base Registers 1-2 

Figure 10-9 shows the register bit assignments, and Table 10-13 provides the 
bit descriptions for the translated base registers 1 through 2. 



Figure 10-9 Translated Base Registers 1-2 



09 08 07 06 05 04 03 02 01 00 



t_Base <32:10> 






























v_ 



Reserved 

LJ-03O87-TI0 



Table 10-13 Translated Base Registers 1-2 



Field 



Extent 



Type, 
Reset 



Description 



t Base<32:10> 



<31:9> 



RW,- 



Reserved 



<8:0> 



MBZ 



If scatter/gather mapping is disabled 
t_Base specifies the base CPU address of 
the translated PCI address for the PCI 
target window. I f scatter/gather mapping 
is enabled t_Base specifies the base CPU 
address for the scatter/gather map table 
for the PCI target window. 



DECchip 21 071 -DA Programmer's Reference 10-31 



10.2.2.6 PCI Base Registers 1-2 

Figure 10-10 shows the register bit assignments, and Table 10-14 provides the 
bit descriptions for the PCI base registers 1 through 2. 

Figure 10-10 PCI Base Registers 1-2 
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Table 10-14 PCI Base Registers 1-2 



Field 



Extent 



Type, 
Reset 



Description 



pci_Base<31:20> <31:20> RW — 
wEnb <19> RW,0 



sGEn 



<18> 



RW,0 



Resa"ved 



<17:0> 



MBZ 



pci_Base specifies the base address of the 
PCI target window. 

When the window enable (wEnb) bit is 
cleared, this PCI target window is disabled 
and will not respond to PCI -initiated 
transfers. When wEnb is set, this PCI 
target window is enabled and will respond 
to PCI -initiated transfers that hit in the 
address range of the target window. This 
bit should be disabled by the processor 
(software) when modifying any of the PCI 
target window registers (base, mask, or 
translated base). 

When the scatter/gather enable (sGEn) 
bit is cleared, the PCI target window uses 
direct mapping to translate a PCI address 
to a CPU address. When this bit is set, 
the PCI target window uses scatter/gather 
mapping to translate a PCI address to a 
CPU address. 
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10.2.2.7 PCI Mask Registers 1-2 

Figure 10-11 shows the register bit assignments, and Table 10-15 provides the 
bit descriptions for the PCI masl< registers 1 through 2. 

Figure 10-11 PCI Mask Registers 1-2 
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Table 10-15 PCI Mask Registers 1-2 



Field 



Type, 
Extent Reset 



Description 



pci_Masl«31:20> <31:20> RW,- 



pci_Mask specifies the size of the PCI 
target window. It is also used in the 
translation of the PCI address to the 
CPU address. 



Reserved 



<19:0> MBZ 
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10.2.2.8 Host Address Extension Register (HAXRO) 

This register is liardcoded to 0. A read from tiiis register returns a 0; a write 
does notlning. 

10.2.2.9 Host Address Extension Register 1 (HAXR1) 

Tin is register is used to generate AD <31:27> on CPU -initiated transactions tinat 
address PCI memory space. Figure 10-12 sliows tlie register bit assignments, 
and Table 10-16 provides the bit descriptions for host address extension 
register 1. 

Figure 10-12 Host Address Extension Register 1 (HAXR1) 
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Reserved 
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Table 10-16 Host Address Extension Register 1 



Field 



Extent 



Type, 
Reset 



Description 



eAddr<4:0> 



Resa"ved 



<31:27> RW,0 



For CPU-initiated transactions to PCI 
memory, eAddr<4:0>are used as the upper 
five PCI address bits (AD<31:27>). 



<26:0> 



MBZ 
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10.2.2.10 Host Address Extension Register 2 (HAXR2) 

This register is used to generate AD <31:24> on CPU-initiated transactions 
tinat address PCI I/O space. It is also used to generate AD<1:0> during 
PCI configuration reads and writes. Figure 10-13 sinows tine register bit 
assignments, and Table 10-17 provides the bit descriptions for host address 
extension register 2. 

Figure 10-13 Host Address Extension Register 2 (HAXR2) 
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Table 10-17 Host Address Extension Register 2 



Field 


Extent 


Type, 
Reset 


eAddr<7:0> 


<31:24> 


RW,0 


Reserved 


<23:2> 


MBZ 


conf Addr<l:0> 


<1:0> 


RW,0 



Description 



For CPU-initiated transactions to PCI I/O 
space, eAddr<7:0> are used as the upper 
eight PCI address bits (AD<31:24>). 



For CPU-initiated transactions to PCI 
configuration space, conf_Addr<l:0> are 
used as the lower two PCI address bits 
(AD<1:0>). 
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10.2.2.11 PCI Master Latency Timer Register 

Figure 10-14 shows the register bit assignments, and Table 10-18 provides the 
bit descriptions for the PCI master latency timer register.. 

Figure 10-14 PCI Master Latency Timer Register 
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Table 10-18 PCI Master Latency Timer Register 



Field 



Type, 
Extent Reset 



Description 



Resa"ved 
p|VlLC<7:0> 



<31:8> MBZ 
<7:0> RW,0 



pMLC<7:0>is loaded into the master 
latency timer register at the start of a 
PCI master transaction initiated by the 
21071-DA chip.i 



■"^This value should be programmed to be non-zero during system configuration. 
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10.2.2.12 TLB Tag Registers 0-7 

These registers are read only. TLB tag registers through 7 have identical 
formats. 

Figure 10-15 shows the register bit assignments, and Table 10-19 provides the 
bit descriptions for TLB tag registers through 7. 

Figure 10-15 TLB Tag Registers 0-7 
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Table 10-19 TLB Tag Registers 0-7 



Field 



Extent 



Type, 
Reset 



Description 



pci_Page<31:13> <31:13> RO,- 



eVal 



Reserved 



<12> RO,0 



<11:0> MBZ 



The pci_Page bit field specifies the PCI 
page address (TAG) corresponding to 
the translated CPU page address in the 
associated TLB data register. 

The entry valid (eVal) bit can be read 
through this bit position. Normally, the 
invalid bit contains the value read during 
a page table entry read. 
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10.2.2.13 TLB Data Registers 0-7 

TLB data registers through 7 have identical formats. These registers are 
read only. Figure 10-16 shows the register bit assignments, and Table 10-20 
provides the bit descriptions for TLB data registers through 7. 

Figure 10-16 TLB Data Registers 0-7 
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Table 10-20 TLB Data Registers 0-7 



Field 



Extent 



Type, 
Reset 



Description 



Reserved <31:21> MBZ 

cpu_Page<32:13> <20:1> RO,— 



Bits <32:13> of the translated CPU 
address can be read through this bit 
field. 



Reserved 



<0> 



MBZ 



10.2.2.14 Translation Buffer Invalidate All (TBIA) 

This register is a write-only register. A write to this register causes all the 
valid entries in the scatter/gather map TLB to be invalidated. 
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11 

DECchip 21 071 -DA Transactions 



This chapter describes the transaction flows for the 21071-DA chip, from the 
sysBus to the PCI and vice versa. 

Throughout this chapter, the terms transaction and command are used 
interchangeably. In general, higher level transactions are composed of lower 
level transactions, or bus commands. For example, a DMA write transaction 
consists of a PCI memory write transaction (command) and a sysBus DMA 
write transaction (command). 

11.1 CPU-Initiated Transactions 

The 21071-DA chip responds to CPU-initiated transactions that address PCI 
space or the 21071-DA CSR space. I n addition to this, it also responds to 
barrier, fetch, and fetchM. fetch and fetchM transactions are acknowledged 
immediately by sending cpuCAck OK on ioCmd<2:0>; no further action is 
taken. 

11.1.1 Remote (PCI) Space I/O Read 

This section describes the 21071-DA chip response to CPU-initiated remote 
(PCI) space I/O read transactions. 

• The sysBus interface continuously monitors the command and address 
on the sysBus. When it detects a read block command and the address 
is in PCI space, it generates the PCI address, byte enables, and the PCI 
command (PCI memory read, PCI I/O read, PCI configuration read, and 
PCI interrupt acknowledge), and it notifies the PCI master state machine. 

• The PCI master state machine asserts the ReqL pin on the PCI and waits 
for the bus to be granted to it. If the bus is parked with the 21071-DA 
chip, that is, GntL is already asserted, then the 21071-DA chip does not 
assert ReqL. 
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If a read from system memory happens on the PCI before the PCI master 
machine has gained ownership of the PCI, the I/O read on the sysBus is 
preempted by the 21071-DA chip. This is done to prevent deadlocl<s from 
occurring. 

When a grant is received on the PCI, the address and command are sent 
out on the PCI, and a request is sent to the epiBus arbiter to set the 
direction of the epiBus from the 21071-DA chip to the 21071-BA chips. The 
epiBus arbiter is described in Section 11.3. 

The PCI master state machine waits for a response from the PCI target 
device. 

When the target responds, the transaction can complete in different ways: 

- If the target successfully returns data, the PCI master terminates the 
transaction and releases the PCI . 

- If the target aborts the transaction with a error, or if a parity error 
is found by the 21071-DA chip on the read data, the PCI master 
terminates the transaction and releases the PCI. When the data is 
returned to the CPU, an error is flagged. 

- If the target disconnects the transaction, indicating that the master 
should retry the transaction at a later time, the PCI master machine 
terminates the transaction, gives up the request for the epiBus, goes 
back to idle, and retries the transaction after the PCI bus becomes 
idle. 

If, when data is available from the PCI device, the epiBus has been granted 
to the PCI master machine, data is sent across the epiBus into the I/O read 
buffer and the request for the epiBus is cleared. If the epiBus has not been 
granted to the PCI master state machine, data cannot be sent to the I/O 
read buffer in the 21071-BA chip and is temporarily held in the 21071-DA 
chip. A subsequent PCI transaction addressed to tine 21071-DA chip may 
stall TrdyL until the I/O read data has been moved. 

The transaction completes when the read data has been returned to the 
CPU. 
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11.1.2 Remote (PCI) Space I/O Write 

This section describes the 21071-DA chip response to CPU-initiated remote 
(PCI ) space I/O write transactions. 

• When an I/O write to PCI space is detected on the sysBus, the data is 
loaded into the I/O write buffer by the 21071-CA chip. The 21071-BA chip 
always has room to accommodate the data for an I/O write transaction. 
One transaction is queued to go out on the PCI , and the second transaction 
stalls on the sysBus until the first is completed. The data for the second 
write is loaded into the second entry of the I/O write buffer (which acts as 
a holding buffer). 

• The address is loaded into the I/O write address buffer, an I/O write 
request is posted to the PCI master state machine, and the transaction is 
acl<nowledged on the sysBus. A second I/O write will stall on the sysBus, 
until the first one completes on the PCI. 

• The PCI master state machine asserts the ReqL pin on the PCI and waits 
for the bus to be granted to it. At the same time, a request is sent to the 
epiBus arbiter to set the direction of the epiBus from the 21071-BA chips 
to the 21071-DA chip. 

• DMA transactions are serviced until the PCI master machine gets 
ownership of the PCI. If a second I/O write, CSR write, I/O read, or 
CSR read is stalled on the sysBus behind this write, it will be preempted 
to allow DMA read transactions or one DMA write transaction (if the DMA 
write buffer is full) to complete. 

• When the PCI master acquires ownership of the epiBus, two longwords of 
data are then loaded into PCI output latches (temporary holding latches 
only). If a DMA read happens before the I/O write has been able to get out 
on the PCI, the data in the holding latches is lost and the arbitration has 
to be performed again. 

• When the PCI master receives the grant on the PCI , it drives the address 
and command, and it asserts FrameL on the PCI if the I/O write data is 
ready to go out on the PCI in the following cycle (if the epiBus has been 
granted to the PCI master for transferring I/O write data). 

• The PCI master drives AD<31:0> and CBE_I<3:0>, and it asserts FrameL. 
This allows the target device to decode and acl<nowledge (by asserting 
DevselL) the address. Data should be ready to be driven on the PCI by 
that time. 
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• As write data is sent out on the PCI, subsequent longwords are unloaded 
from the I/O write buffer into the PCI output latches through the epiData 
bus, until the transaction is terminated. 

• It is possible that the target of the I/O transaction retries or disconnects 
the transaction before all the data has been transferred. The 21071-DA 
chip waits for the PCI to become idle and performs another I/O write 
transaction with the unwritten data. 

• If the entire data transfer completes, the transaction is terminated on the 
PCI. 

11.1.3 CSR Space I/O Read 

A read from the 21071-DA CSRs behaves similarly to the remote read, except 
that the transaction does not go out on the PCI . I nstead, the data is read from 
the 21071-DA CSRs. 

Because CSR reads complete with a fixed l<nown latency, a CSR read 
transaction is not preempted unless it is queued behind an I/O write to the 
PCI, which cannot complete because of DMA transactions on the PCI directed 
toward system memory. No errors are detected during this transaction. 

11.1.4 CSR Space I/O Write 

A write to the 21071-DA CSRs behaves similarly to the remote write, except 
that the transaction does not go out on the PCI . I nstead, the data is written to 
the 21071-DA CSRs. Data from the 21071-BA chip still has to be transferred 
to the 21071-DA chip. 

CSR writes, like I/O writes, are only preempted if they are queued behind an 
I/O write to the PCI, which cannot complete because of DMA transactions on 
the PCI. 

1 1 .1 .5 Memory Barrier 

The 21071-DA chip uses the memory barrier transaction as a means of 
synchronization between the DMA stream and the CPU stream. 

On a memory barrier transaction, the 21071-DA chip flushes the I/O write 
buffer and those entries of the DMA write buffer that were valid when the 
memory barrier transaction was recognized on thesysBus. The 21071-DA chip 
preempts the memory barrier in order to flush the DMA writes. A memory 
barrier is also preempted if a DMA read transaction is directed toward the 
21071-DA chip, or if the DMA write buffer becomes full while the 21071-DA 
chip is waiting to unload its I/O write buffer on the PCI. 
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11.2 PCI-lnitiated Transactions 

The 21071-DA chip supports PCI -initiated transactions directed towards 
system memory only. System memory can be mapped to two regions in PCI 
space. The PCI slave is always monitoring FrameL and the PCI address 
and command to determine if there is a transaction targeted towards system 
memory. All PCI memory write commands are treated the same, and all the 
PCI memory read commands are treated the same, except for read multiple 
commands (causes start of prefetch sequence). 

11.2.1 PCI Memory Read, Read Line, and Read Multiple 

This section describes the 21071-DA chip response to PCI memory read, read 
line, and read multiple transactions. 

• Whenever the slave machine sees FrameL asserted, it checks for a valid 
PCI command and for a hit in one of its address windows. If the address or 
command is a hit, it asserts DevselL and proceeds with the transaction. If 
the address or command is a miss, it does not do anything. 

• The slave machine posts a DMA read request to the sysBus master 
machine. 

• A DMA request is posted to the sysBus arbiter. 

• The sysBus master compares the read address with addresses queued in 
the DMA write buffer. If there is a hit, writes are serviced until the write 
that matches the read has been retried on the sysBus. If bypass mode is 
turned off, the DMA read does not proceed until all buffered DMA writes 
are completed. 

Note 



The comparison is on untranslated PCI addresses, not on physical 
memory addresses. 



At the same time, the sysBus master does a lookup in the TLB in order to 
determine if a scatter/gather map read is necessary. A scatter/gather map 
read is performed if the PCI window being addressed had scatter/gather 
mapping enabled, and there was a miss in the TLB. 

The scatter/gather read is performed (described later) or the TLB is read, 
and a translated physical memory address is generated. 
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When grant is received on the sysBus, the sysBus master will perform a 
fetch. The 21071-DA will also perform a prefetch, if the prefetch enable bit 
is set in the DCSR and the transaction is addressed to the even cache line, 
or if the PCI command is a read multiple and the transaction is addressed 
to the even cache line. If prefetching will be performed, an atomic request 
is posted on ioRequest<l:0>. The sysBus master arbitrates for the epiBus 
so that the direction of the epiBus is from the 21071-BA chips to the 
21071-DA chip. 

The address and command are sent out on the sysBus. If the sysBus 
master is prefetching, two DMA read transactions are done one after the 
other on the sysBus (guaranteed by the atomic request). A DMA read burst 
command is used on the first read, and a DMA read command is used on 
the prefetched read. If the requested data is the second octaword in the 
cache line, the wrapped command is used; if prefetching is enabled, the 
first command used is DMA read burst wrapped, but the second command 
is always DMA read. 

The read data (either from the cache or memory) is loaded into the DMA 
read buffer of the 21071-BA chip by the 21071-CA chip. The control signals 
are used to access the appropriate cache line, and longwords within it are 
set up when the sysBus master receives ownership of the epiBus. As the 
data is loaded into the DMA read buffer, valid bits are set to indicate which 
longwords are ready to be returned on the PCI . 

The termination conditions of a PCI memory read transaction are as 
follows: 

- The initiator terminates the transaction. 

- Prefetching is not enabled and a cache line boundary crossing is 
attempted. 

- A burst attempts to cross an odd-even cache line boundary, even if 
prefetching is enabled. 

- An uncorrectable memory data error is detected on the requested data. 

- The sysBus transaction is acknowledged with an error by the 21071-CA 
chip. 

If the sysBus transaction completes before the PCI transaction (this will 
usually happen on a long burst), the sysBus is released. 
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• If the PCI transaction completes before the sysB us transaction (this will 
usually happen on a short burst when prefetching is enabled), data 
remaining in the DMA read buffer is discarded. If the transaction 
completes before the prefetch has started on the sysBus, the prefetch 
transaction will not be performed. 

11.2.2 PCI Memory Write/Write and Invalidate 

This section describes the 21071-DA chip response to PCI memory write/write 
and invalidate transactions. 

• The transaction begins just like a read. If the address is a hit, DevselL is 
asserted and the transaction continues. 

• The PCI slave machine requests arbitration of the epi Bus. The default path 
of the epIBus is in the direction of the DMA write. If the PCI slave machine 
has the epIBus, the write data is transferred to the 21071-BA chips. If the 
epIBus is busy doing a CSR read, a CSR write, or a scatter/gather read, the 
PCI slave will stall the first data transfer. 

• After the DMA write path is set up between the 21071-DA chip and 
the DMA write buffer through the epiBus, the 21071-DA chip does not 
stall data transfers. If the transfer is stalled by the PCI master, the 
corresponding epiBus transfer is also stalled. 

• The termination conditions of the PCI memory write transactions are as 
follows: 

- The initiator terminates the transaction. 

- The write burst attempts to cross an odd-even cache line boundary. 

- Only one DMA write buffer entry was available at the beginning of the 
transaction, and a cache line boundary crossing is attempted. 

• If the write buffer was full at the beginning of the transaction or if the 
21071-DA chip is locked by a different PCI bus master, the PCI slave 
disconnects without any data transfers. 

• When a cache line boundary is crossed, and there were no data parity 
errors, a valid bit is set, and the corresponding cache line entry is ready 
to go out on the sysBus. If a write data parity error is detected on any 
longword of that cache line, the valid bit is not set and data is not written 
to memory. The PCI burst continues normally. 

• The sysBus master is always monitoring the state of the DMA write buffer. 
When it sees a valid write, it performs the address translation (doing 

a scatter/gather read if necessary) and requests the sysBus using DMA 
request. 
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• The address of the transaction is sent out on thesysBus along with a DIM A 
write full or DMA write masked command, depending on whether the 
entire cache line has valid data. 

• The transaction completes when the 21071-CA chip returns an OK on 
ioCAcl<<L:0>tothe21071-DA chip. 

11.2.3 PCI Exclusive Access to System Memory 

This section describes the 21071-DA chip response to PCI exclusive access to 
system memory transactions. 

• The PCI slave machine monitors the LockL signal along with FrameL. It 
uses the value of LockL in the cycle of FrameL assertion and in the cycle 
following the assertion of FrameL to determine whether or not the access 
is locked. 

• If LockL is asserted during the cycle of FrameL, the PCI slave machine will 
not accept the transaction and will terminate it with a target disconnect 
(retry, no data transfers). 

• If LockL is deasserted during the cycle of FrameL and is deasserted in 
the following cycle, the transaction proceeds normally as described in the 
previous sections. 

• If LockL is deasserted during the cycle of FrameL, and is asserted in the 
following cycle, the transaction is treated as locked. 

• Locked transactions are not treated specially on the PCI by the PCI slave 
machine. They clear the system lock flag. The system locl< flag is held 
clear until the PCI LockL signal is released by the locking master. DMA 
read bypass is disabled as long as there are locked writes in the DMA write 
buffer. 

• The system lock flag is cleared by sending an ioClrLock encoding on 
ioCmd<2:0> instead of an Idle encoding when the 21071-DA chip does not 
own the sysBus. 

11.2.4 Scatter/Gather Map Read 

A scatter/gather read is similar to a PCI -initiated DMA read on the sysBus. 
Data has to be loaded from the DMA read buffer into the TLB. 

If errors (uncorrectable data errors, memory errors, and invalid scatter/gather 
entry errors) are found on a scatter/gather read, the transaction that caused 
the scatter/gather read is not performed. On PCI read transactions, the 
transaction is aborted by the 21071-DA chip; on writes an interrupt is posted. 
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11.3 epiBus Arbitration 

At any given time, the 21071-DA chip could be servicing multiple transactions 
(CPU-initiated and PCI-initiated), all of which have to use the epiBus. The 
21071-DA chip contains a central epiBus arbiter, which arbitrates for the 
bus and appropriately sets the direction of the epiBus. The PCI master and 
slave, as well as the sysBus master and slave, all request the bus for various 
transactions. Table 11-1 lists the priority of the various requests and the 
direction of the epiBus. 

Table 11-1 epiBus Arbitration Priority 

Priority Transaction Direction 

1 PCI! /Oreads 21071-DA to 21071-BA 

2 DMA writes (default) 21071-DA to 21071-BA 

3 DMA reads 21071-BA to 21071-DA 

4 PCI I/O writes 21071-BA to 21071-DA 

5 CSR writes 21071-BA to 21071-DA 

6 CSR reads 21071-DA to 21071-BA 

7 Scatter/gather reads 21071-BA to 21071-DA 
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This chapter includes the following information about the DECchip 21071-DA 
chip: 

• DC Electrical Data 

• AC Electrical Data 

12.1 DC Electrical Data 

This section describes the dc characteristics of the DECchip 21071-DA chip. 

12.1.1 Absolute Maximum Ratings 

Table 12-1 lists the maximum ratings of the DECchip 21071-DA chip. 
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Table 12-1 DECchip 21071-DA Maximum Ratings 



Characteristics IVIinimum Maximum 



Storage temperature 
Operating ambient temperature 

Air flow 

J unction temperature 



Supply voltage with respect to Vss, 
witii reset_l asserted 

Supply voltage witii respect to Vss, 
witii reset I deasserted 



Voltage on any pin witii respect to Vss 



Maximum power: 

(a/dd= 5.25 V 1.5 W 

(aCycle =30 ns 

^Ifpm = linear feet per minute 



-55°C (-67°F) 


125°C (257°F 


0°C (32°F) 


40°C (104°F 


100 Ifpm^ 


— 


25°C (77°F) 


85°C (185°F 


-0.5 V 


-f6.5 V 


4.75 V 


5.25 V 


-0.5 V 


Vdd +0.5V 
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Table 12-2 lists the dc parametric values of the DECchip 21071-DA chip. 



Table 12-2 DC Parametric Values 



Symbol 


Description 


Minimum 


Maximum 


Units 


Test Conditions 


^ih 


Input high voltage 


2.0 


- 


V 


- 


^il 


Input low voltage 


- 


0.8 


V 


- 


^oh 


Output high voltage 


2.4 


- 


V 


- 


^ol 


Output low voltage 


- 


0.4 


V 


- 


'il 


1 nput leakage current-^ 


-5 


5 


HA 


OV <Vin <Vdd 


'ilpu 


1 nput leakage current^ 


-15 


-100 


iiA 


OV <Vin <Vdd 


'ilpd 


1 nput leakage current^ 


15 


100 


[XA 


OV <Vin <Vdd 


'ol 


Output leakage current 
(tristated) 


-10 


10 


HA 


OV <Vin <Vdd 



^Excluding scanEn, testMode, and tristateL. 

^For tristateL. 

^For scanEn and testMode. 
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12.2 AC Electrical Data 

This section describes tine ac characteristics of the DECchip 21071-DA chip. 

12.2.1 Clocks 

The DECchip 21071-DA chip uses one clocl< (running at twice the nominal 
system frequency) plus a synchronous phase reference signal to generate four 
internal clock edges. An additional clock input is used to generate two internal 
clocks for the PCI logic. See Figures 12-1 and 12-2, and Tables 12-3 and 12-4 
for details about DECchip 21071-DA external clock requirements and internal 
clock phase relationships. 

A clock system must meet the requirements shown in Figure 12-1 and 
Table 12-4 to guarantee the proper behavior of the 21071-DA chip's internal 
logic. The 21071-DA chip does not specify the maximum skew allowed for 
external transfers to or from the CPU, Bcache PALs, Bcache, 21071-BA 
chips, or 21071-CA chip because these skew limits are dependent on module 
placement and routing. A system designer must examine external transfers to 
determine the maximum clock skews allowed between chips. 

The skew numbers shown in Figure 12-1 and Table 12-4 are given for 
a 30.0 ns cycle time. At a longer cycle time, the allowable skew may be 
increased, as long as the given minimum times between clock edges are not 
violated. These skew limits assume that the 21071-DA chip adds another 0.1 
ns of uncertainty between rising and falling edges due to non-ideal input buffer 
switching thresholds. 
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Table 12-3 DECchip 21071-DA Clock AC Characteristics 



Parameter 



Minimum 



lUlaximum 



Unit 



Note 



System cycle time 
elk 1x2 period 
clklx2 frequency 
clklx2 rise time 
clklx2 fall time 

pClk period 
pClk rise time 

clk2ref setup to clklx2 rising 
clk2ref hold from clklx2 rising 



30 


— 


ns 


cin Figure 12-1 


15 


— 


ns 


— 


— 


66 


MHz 


— 


— 


1 


ns 


— 


— 


1 


ns 


— 


30 


— 


ns 


— 


— 


1 


ns 


— 


0.4 


— 


ns 


Tsu in Figure 12-1 


2.3 


— 


ns 


Th in Figure 12-1 



Figure 12-1 DECchip 21071-DA Clock Skew Requirements 
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Table 12-4 DECchip 21071-DA Clock Skew Limits at clk1x2 Pin 



Parameter 



Example Transfers 



Maximum Unit Note 



clklx2 or pClk rising edge to 
same clock rising edge 

clklx2 or pClk falling edge to 
same clock falling edge 

clklx2 rising edge to falling edge 

clklx2 falling edge to rising edge 

clklx2 rising edge to pClk rising 

edge, 

pClk rising edge to clklx2 rising 

edge 

clklx2 falling edge to pClk 
falling edge, 

pClk falling edge to clklx2 
falling edge 

clklx2 rising edge to pClk falling 

edge, 

clklx2 falling edge to pClk rising 

edge, 

pClk rising edgetoclklx2 falling 

edge, 

pClk falling edgetoclklx2 rising 

edge 

pClk rising edge to falling edge, 
pClk falling edge to rising edge 



clklR to clklR, clklR to clklF, 
clklF to ClklR, clklF toclklF, 
pClkR topClkR 

clk2R to clk2R, clk2R to clk2F, 
clk2F toclk2R, clk2F toclk2F, 
pClkF topClkF 

ClklR toclk2R, clklR toclk2F, 
ClklF toclk2R, clklF toclk2F 

clk2R to ClklR, clk2R toclklF, 
clk2F to ClklR, clk2F toclklF 

ClklR to pClkR, ClklF to pClkR, 
pClkR to ClklR, pClkR toclklF 



clk2R to pClkF, clk2F to pClkF, 
pClkF toclk2R, pClkF toclk2F 



ClklR to pClkF, ClklF to pClkF, 
clk2R to pClkR, clk2F to pClkR, 
pClkR toclk2R, pClkR toclk2F, 
pClkF to ClklR, pClkF toclklF 



0.50 


ns 


@ Cycle 
= 30ns 


1.25 


ns 


@ Cycle 
= 30ns 


1.60 


ns 


@ Cycle 
= 30ns 


1.60 


ns 


@ Cycle 
= 30ns 


2.10 


ns 


@ Cycle 
= 30ns 


2.85 


ns 


@ Cycle 
= 30ns 


3.35 


ns 


@ Cycle 
= 30ns 



pclkR topClkF, 
pclkF topClkR 



1.75 



ns @ Cycle 
= 30 ns 
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Figure 12-2 DECchip 21071-DA Clock Signals 
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The 21071-DA imposes no requirements on cll<l or sysClkOutl. Skew on 
clkl will be constrained by limits imposed by external paths to or from the 
B cache control PALs. The phase error between sysClkOutl and clklx2 will be 
constrained by limits imposed by external paths to or from the CPU chip. 

12.2.2 Signals 

Figures 12-3 and 12-4 demonstrate the timing measurements specified in 
Tables 12-6 and 12-7. 
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Figure 12-3 DECchip 21071-DA Output Delay Measurement 
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Figure 12-4 DECcliip 21071-DA Setup and Hold Time Measurement 
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The following ac electrical data is specified with respect to the appropriate edge 
at the clklx2 or pClk pins. Both the output delay table and the setup/hold 
timetable assume a 1 ns edge rate at the clklx2 and pClk pins. 

All outputs drive a 50 pF load. When estimating module delays, you may need 
to replace the 50 pF load delay with a simulated (or calculated) delay. The 
delays for 4 mA and 8 mA drivers that drive a 50 pF load are provided in 
Table 12-5. See Table 8-1 for information about the buffer size of every output 
pin. 



Table 12- 


-5 


DECchip 21071 


-DA Output Buffer Delays 


into a 


50 pF Load 


Type 




Minimum 


lUlaximum 


Unit 






4 mA 
8 mA 




3.5 
2.3 


7.6 
5.0 


ns 
ns 
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Table 12-6 DECchip 21071-DA AC Characteristics (Valid Delay into a 50 pF Load) 
Signal Minimum Maximum Unit Reference Edge 

sysAdr<33:5> 

ioRequest<l:0>, 
ioCmd<2:0> 



4.8 


14.2 


ns 


clklR 


4.6 


11.8 


ns 


clklR 


2.0 


11.0 


ns 


pClkR 



AD<31:0>, 

CBEJ<31:0>, 

Par, 

FrameL, 

TrdyL, 

IrdyL, 

StopL, 

PerrL, 

LockL, 

DevselL 

MemAckl, 2.0 12.0 ns pClkR 

ReqL 



epiData<31:0>, 4.8 16.1 ns clklR 

epiBEnErr<3:0> 



epiSelDMA, 4.8 14.9 ns clklR 

epiFromlOB, 

epiOWSel, 

epi Lines el <1:0>, 

epiEnable<3:0>, 

ioLineSel<l:0>, 

epILInelnval 



IntHwO 7.5 20.1 ns clklF 
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Table 12-7 DECchip 21071-DA AC Characteristics (Setup/Hold Time) 



Signal 



Setup Hold 



Unit 



Reference Edge 



sysAdr<33:5> 
cpuCWM asl«7:0> 
cpuCReq<2:0> 
cpuCReq<2:0> 
cpuHoldAck 



13.1 


3.7 


ns 


clklR 


7.1 


3.8 


ns 


clklR 


0.0 


3.2 


ns 


clklF 


15.1 


0.0 


ns 


ClklR 


-0.3 


3.0 


ns 


clklF 



ioGrant, 

ioCAck<l:0>, 

ioDataRdy 



-0.3 3.2 ns ClklF 



AD<31:0>, 

CBEJ<31:0>, 

Par, 

FrameL, 

TrdyL, 

IrdyL, 

StopL, 

PerrL, 

LockL, 

DevselL, 

MemReql 

GntL 



7.0 0.0 ns pClkR 



10.0 0.0 ns pClkR 



epiData<31:0> 
epiBEnErr<3:0> 



0.4 5.2 ns clk2F 
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DECchip 21 071 -DA Power-Up and 

Initialization 



This chapter describes the behavior of the DECchip 21071-DA on power-up and 
assertion of reset_l. It also describes the system level requirements and the 
various registers that have to be initialized after reset_l is deasserted. 

13.1 Power-Up 

On power-up, the reset_l input of the DECchip 21071-DA should be asserted. 
It should be kept asserted until the system clocks are up and running for 20 
cycles. 

13.2 Internal Reset 

The assertion and deassertion of the reset_l pin on the module is asynchronous 
to the DECchip 21071-DA chip. An internal reset signal is generated from 
reset_l, which asserts asynchronously as soon as reset_l is asserted but 
deasserts synchronously. Due to the synchronous deassertion of the internal 
reset, the DECchip 21071-DA requires that no external transaction should 
start until 10 system clock cycles after the deassertion of reset_l. 

13.3 State of Pins on Reset Assertion 

The following are general rules and requirements for the behavior of DECchip 
21071-DA pins during reset: 

• All input only control signals (except the clocks and resetj) should be in 
the deasserted state as long as reset is asserted. 

• All output only signals are deasserted. 

• All bidirectional signals are tristated. 
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The exceptions to these rules are as follows: 

• sysAdr<33:5> are driven synchronously with the assertion of reset and are 
tristated as soon as reset_l deasserts (without waiting for the deassertion 
of synchronous internal reset). 

• epiData<31:0>and epiBEnErr<3:0> are driven as long as reset is asserted, 
and they continue to be driven after reset_l deassertion. 

• ReqL is tristated on the assertion of reset_l and remains tristated until the 
deassertion of reset_l. 

• If the PCI is not parked (that is, GntL is deasserted during reset) with 
the DECchip 21071-DA, then AD<31:0> and CBEJ<3:0> are tristated 
immediately on the assertion of reset_l, and Par is tristated a cycle later. 
If the PCI is parked with the DECchip 21071-DA (that is, GntL is asserted 
during reset), then AD<31:0>, CBE_I<3:0>, and Par are driven to 0. 

• memAckl is tristated on the assertion of reset_l and remains tristated until 
the deassertion of reset_l. 

Note 



I n all cases, the assertion of tristate_l overrides the assertion of reset_l. 
That is, if tristate_l is asserted during reset, all the outputs of the 
chip go to their High-Z state. If resetj is still asserted when tristate_l 
deasserts, the signals return to the normal reset state described 
previously 



13.4 Configuration after Reset Deassertion 

The following states must be initialized by software in the DECchip 21071-DA 
chip after the deassertion of reset_l. 

Diagnostic control and status register (DCSR) 

PCI base address registers 

PCI mask registers 

Translated base address registers 

Host address extension registers 

PCI master latency timer register 
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Part 




Part 1 1 1 contains five chapters that provide information about the DECchip 
21071-BA chip. The following table provides a brief description of each 
chapter: 



Chapter 



Description 



14 
15 
16 

17 

18 



Describes the DECchip 21071-BA pin signals. 

Describes the DECchip 21071-BA architecture. 

Describes the flow of data within the DECchip 21071-BA for various 
transactions on thesysBus, memory data bus, and PCI bus. 

Describes the DECchip 21071-BA electrical requirements. 

Describes the behavior of the DECchip 21071-BA chip during power-up. 
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DECchip 21071-BA Pin Descriptions 



The 21071-BA chip interfaces to three major buses: 

• sysBus 

• IMemory data bus 

• epiBus 

This chapter provides a brief description of the pin signals for the 21071-BA 
data chip followed by detailed description of the 21071-BA data chip interfaces. 
This chapter also provides pin connection tables for the 21071-BA data chips in 
different bus width modes and for each 21071-BA instance (21071-BA 0,1,2,3). 

14.1 DECchip 21071-BA Pin List 

Table 14-1 lists the pin signals grouped by function. The information in the 
Type column identifies a signal as input (I), output (O), or bidirectional (B). 
The Buffer Strength column indicates the buffer drive strength. 

All output and bidirectional pins, except pTestout, can betristated. 



DECchip 21071-BA Pin Descriptions 14-1 



Table 14-1 DECchip 21 071 -B A Pin List 



Signals 



Quantity Type 



Buffer 
Strengtii 



Function 



CPU/Bcache Interface Signals 
(66 Total) 



sysData<63:0> 



sysPar<l:0> 



64 



4 ma sysBus Data. 

In ECC mode, 
sysChecl«6:0> appears 
on sysData<38:32>, and 
memChecl«6:0> appears 
on sysData<57:63>. 

4 ma Parity pins for sysBus 

data. 



Cache/Memory Data Path Control Signals 
(13 Total) 



drvSysData 




drvSysCSR 




drvMemData 




syslORead 




sysReadOW 




subCmd<l:0> 


2 1 


sysCmd<2:0> 


3 1 


memCmd<3:l> 


3 1 



Turns on 21071-BA 
sysData<63:16> drivers. 

Turns off 21071-BA 
sysData<15:0> drivers. 

Turns on 21071-BA 
memData and memPar 
drivers. 

Selects I/O read buffer to 
sysBus. 

Selects octaword to be 
read. 

Sub-commands for 
sysBus side of tlie 
21071-BA. 

Commands for sysBus 
side of tlie 21071-BA. 

Commands for memory 
sideof ciiip. 

(continued on next page) 
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Table 14-1 (Cont.) DECchip 21071-BA Pin List 



Signals 



Buffer 
Quantity Type Strength 



Function 



epIBus Signals 
(46 Total) 



epiData<31:0> 32 

epiBEnErr<3:0> 4 



epIFromlOB 



epiSelDMA 



epiEnable<l:0> 



epiOWSel 



epi Lines el <1:0> 



ioLineSel<l:0> 



epILInelnval 



B 4 ma Interchip data for both 

DMA and I/O operations. 

B 4 ma epi Data byte enables 

for epi Bus from the 
21071-DA operations and 
error/corrected status 
for epi Bus to 21071-DA 
operations. 

I — Selects the next epi Bus 

transfer from the 
21071-DA to the data 
chip. 

I — Selects which buffer 

(I/O or DMA) will 
be transferred on the 
epi Data bus. 

I — Oualifies epi Data control 

signals and enables 
output drivers. 

I — Selects which octaword 

of the cache line will 
be transferred on the 
epi Data bus. 

I — Selects which cache line 

will be transferred on 
the epi Data bus. 

I — Selects which cache line 

should be read or written 
from the sysBus. 

I — Clears all byte valid bits 

in the current line of the 
DMA write buffer. 

(continued on next page) 
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Table 14-1 (Cont.) DECchip 21071-BA Pin List 



Signals 


Quantity 


Type 


Buffer 
Strengtii 


Function 


Memory Signals 
(33 Total) 


memData<31:0> 
memPar 


32 
1 


B 
B 


4 ma 
4 ma 


Memory data. 
Memory parity pins. 


Miscellaneous/Clock 
Signals (8 Total) 



cll<lx2 
cll<2ref 

reset_l 
test Mode 
tristatej 
pTestout 

eccMode 

wideMem 



4 ma 



Clock input. 

Pliase reference for 
cll<lx2. 

Reset. 

Test mode select. 

Tri state. 

Parametric NAND tree 
output. 

True indicates ECC 
enabled. 

True indicates 128-bit 
wide memory. 



Pin Totals 

Total signal pins: 

Total power and ground pins: 

Total pins: 



166 
42 

208 



14.2 DECchip 21071-BA Signal Descriptions 

This section provides signal descriptions of the 21071-BA data chip, the clock 
edges at which they can change, and rules about their usage during various 
transactions. 

For simplicity, sysClkOutl_h is treated as clklR. 
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Signal descriptions are grouped by function and correspond to the pin list 
provided in Section 14.1. 

Note 



The DECchip 21064 microprocessor does not use clklR, but it uses 
sysClkOutl_h to generate and sample signals. 



14.2.1 CPU/Bcache Interface Signals 

See Section 2.2.4 for descriptions of 21071-CA signals that control 21071-BA 
data chip functions. 

This section describes the CPU/Bcache signals. 

14.2.1 .1 sysData<63:0>, sysPar<1 :0> 

Signal Type: Bidirectional (21071-BA, CPU, Bcache) 
Input Sampling Clock Edge: clk2F 
Output Clock Edge: clklR 

sysData<53:0> is a bidirectional bus which provides data to and from 
the 21071-CA chip and the CPU. sysPar<l:0> are the parity bits for 
sysData<53:0>. 

The CPU is the default driver of sysData. 

When the system is configured in longword parity mode: 

• sysPar<0> is the even parity across sysData<31:0> and is connected to 
check <0> of the processor. 

• sysPar<l> is the even parity across sysData<53:32> and is connected to 
check <7> of the processor. 

When the system is configured in longword ECC mode: 

• sysData<38:32> is the ECC across sysData<31:0>and is connected to 
check <6:0> of the processor. 

• sysData<57:63> {note reversed order) is the ECC across memData<31:0>, 
and is connected to check <6:0> of the memory bus. 
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14.2.2 Cache/Memory Data Path Control 

This section describes the cache/memory data path control signals. 

14.2.2.1 drvSysData 

Signal Type: 21071-BA Input 

Signal Source: 21071-CA 

Input Sampling Clock Edge: clklR assertion, clklF deassertion 

When drvSysData is sampled asserted, the 21071-BA chips drive sysData<63:16> 
and sysData<15:0> (only if drvSysCSR is deasserted) on this clklR. 

14.2.2.2 drvSysCSR 

Signal Type: 21071-BA Input 

Signal Source: 21071-CA 

Input Sampling Clock Edge: clklR 

When drvSysCSR is asserted, the 21071-BA chips will not drive sysData<15:0> 
on the next clklR. 



14.2.2.3 drvMemData 

Signal Type: 21071-BA I nput 
Signal Source: 21071-CA 
Input Clock Edge: Flow through 

drvMemData directly controls the memData drivers on the 21071-BA chips. 
When drvMemData is asserted, memData is driven; when drvMemData is 
deasserted, memData is tristated. 



14.2.2.4 syslORead 



Signal Type: 21071-BA I nput 

Signal Source: 21071-CA 

Input Sampling Clock Edge: clk2F 

syslORead is asserted by the 21071-CA chip along with drvSysData to indicate 
that the contents of the I/O read buffer should be driven onto the sysBus. 

syslORead is used by the 21071-BA chips to drive the contents of the I/O read 
buffer onto the sysBus. 
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14.2.2.5 sysReadOW 



Signal Type: 21071-BA Input 

Signal Source: 21071-CA 

Input Sampling Clock Edge: clk2F 

sysReadOW is asserted by the 21071-CA chip to indicate to the 21071-BA chips 
that the upper octaword of data should be taken from the memory read, merge, 
and I/O read buffers. 



14.2.2.6 subCmd<1 :0> 



Signal Type: 21071-BA Input 

Signal Source: 21071-CA 

Input Sampling Clock Edge: clk2F 

The subCmd<l:0> signals are asserted to further qualify the sysCmd<2:0> 
signals, as described in Table 14-2. 

The subCmd<l:0> signals, in conjunction with sysCmd<2:0> signals, are used 
by the 21071-BA chips as commands for operations on the sysBus data buffers. 



14.2.2.7 sysCmd<2:0> 



Signal Type: 21071-BA Input 

Signal Source: 21071-CA 

Input Sampling Clock Edge: clk2F 

The sysCmd<2:0> signals, in combination with the subCmd<l:0> signals, 
indicate to the 21071-BA chip the action to take on the sysData bus. I n 
general, they echo the actions that took place on the sysBus during the 
previous cycle. The bits are decoded into various actions. Table 14-2 describes 
the sysCmd<2:0>and subCmd<l:0> encodings. 
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Table 14-2 sysCmd<2:0> and subCmd<1:0> Encodings 



sysCmd 



subCmd 



Mnemonic 



Function 



000 



000 



ox 



IX 



RESET 



NOP 



001 



010 



XX 



XX 



LOAD 



RDDMAS WRIO 



Oil 



XX 



RDDMAM 



The merge bits in tlie merge buffer 
are cleared. All sysBus counters 
are reset. The data in the pad 
latches is held (to save power). 

The data in the pad latches is held 
in the latches, and new data will 
not be clocked into them. Used 
during reads or to hold the first 
transfer of write data due to a full 
write buffer. 

No write action is performed. Sent 
when waiting for write data to be 
ready. Data from the sysData bus 
is loaded into the pad flops. 

Data in the sysData pad latches is 
loaded into the DMA read buffer, 
which also serves as the I/O write 
buffer. A counter is incremented 
so that the next RDDMAS will 
load data into the next sub-cache 
line of the buffer. 

Data in the memory read buffer is 
loaded into the DMA read buffer. 
A counter is incremented so that 
the next RDDMAM will load data 
into the next sub-cache line of the 
buffer. 

(continued on next page) 
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Table 14-2 (Cont.) sysCmd<2:0> and subCmd<1 :0> Encodings 



sysCmd 



subCmd 



Mnemonic 



Function 



100 



00 



MERGEOO 



100 



100 



100 



101 



01 



10 



11 



00 



MERGEOl 



MERGE 10 



ME RGB 11 



WRSYSO 



101 



01 



WRSYSl 



Nothing is loaded into tlie merge 
buffer. A counter is incremented 
sotliat tlie next MERGEnn will 
load data into the next sub-cache 
line of the buffer. 

During STx_C transactions that 
hit in the cache, each sub-cache 
line of the merge buffer is loaded 
twice: once with the CPU write 
data using MERGE (that is, 
M E RGEOl) and once with the 
cache data using MERGE with 
inverted enables, called an overlay 
(that is, OVLYIO). 

Same as MERGEOO, but longword 
O's data in thesysData pad latches 
is loaded into the read/merge 
buffer, and longword O's merge bit 
is set. 

Same as MERGEOO, but longword 
I's data in thesysData pad latches 
is loaded into the read/merge 
buffer, and longword I's merge bit 
is set. 

Same as MERGEOO, but longword 
and I's data in the sysData pad 
latches is loaded into the read 
/merge buffer, and longword and 
I's merge bits are set. 

Data in the sysData pad latches 
is loaded into the memory write 
buffer that represents cache line 
0. A counter is incremented so 
that the next WRSYSO will load 
data into the next sub-cache line 
of cache lineO. 

Same as WRSYSO, but for cache 
line 1. 

(continued on next page) 
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Table 14-2 (Cont.) sysCmd<2:0> and subCmd<1:0> Encodings 



sysCmd 



subCmd 



Mnemonic 



Function 



101 


10 


WR5YS2 


101 


11 


WRSYS3 


110 


00 


WRDMASO 



110 


01 


WRDMASl 


110 


10 


WRDMAS2 


110 


11 


WRDMAS3 


111 


00 


WRDMAMO 



Same as WRSY50, but for cache 
line 2. 

Same as WRSYSO, but for caclie 
lines. 



Data in tiie sysData pad iatciies 
is merged with the DMA write 
buffers and is loaded into 
the memory write buffer that 
represents cache line 0. A counter 
is incremented so that the next 
WRDMASO will load data into the 
next sub-cache line of cache line 0. 

Same as WRDMASO, but for cache 
line 1. 

Same as WRDMASO, but for cache 
line 2. 

Same as WRDMASO, but for cache 
line 3. 

Data in the memory read buffer 
is merged with the DMA write 
buffers and is loaded into 
the memory write buffer that 
represents cache line 0. A counter 
is incremented so that the next 
WRDMAMO will load data into the 
next sub-cache line of cache line 0. 



Ill 


01 


WRDMAMl 


Same as WRDMAMO, 
cache line 1. 


but for 


111 


10 


WRDMAM2 


Same as WRDMAMO, 
cache line 2. 


but for 


111 


11 


WRDMAM3 


Same as WRDMAMO, 
cache line 3. 


but for 
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14.2.2.8 memCmd<3:1> 



Signal Type: 21071-BA Input 

Signal Source: 21071-CA 

input Sampling Clock Edge: clklR 

The memCmd<3:l> signals indicate to the 21071-BA chips which action to tal<e 
on the memData bus. For the encodings of memCmd<3:l>, see Table 14-3. 



Table 14-3 memCmd<3:1> Encodings 



memCmd 



Mnemonic 



Function 



000 



001 



010 

oil 

100 



101 



RDIMM 



RDDLY 



NOP 

RESET 

WRIMM 



WRDLY 



Read data is loaded into the read/merge buffer on 
the next memClkR. A counter is incremented so 
that the next RDxxx will load data into the next 
available sub-cache line of the read buffer. 

Read data is loaded into the read/merge buffer on 
the memClkR after the next memClkR. A counter 
is incremented so that the next RDxxx will load 
data into the next available sub-cache line of the 
read buffer. 

No operation. 

All memory counters are reset. 

Data from the memory write buffer is driven 
to memory on the next memClkR. A counter is 
incremented so that the next WRxxx will drive the 
next sub-cache line to memory. 

Data from the memory write buffer is driven to 
memory on the memClkR after the next memClkR. 
A counter is incremented so that the next WRxxx 
will drive the next sub-cache line to memory. 



110 



WRIMML 



Data from the memory write buffer is driven to 
memory on the next memClkR. After the write, 
the quadword pointer is reset to 0, and the cache 
line pointer is incremented so that the next WRxxx 
will drive the first sub-cache line of the next line 
to memory. 

(continued on next page) 
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Table 14-3 (Cont.) memCmd<3:1> Encodings 



memCmd Mnemonic Function 



111 WRDLYL Data from the memory write buffer is driven to 

memory on tine memCII<R after tine next memCII<R. 
After the write, the quadword pointer is reset to 0, 
and the cache line pointer is incremented so that 
the next WRxxx will drive the first sub-cache line 
of the next line to memory. 
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14.2.3 epiBus Signals 

This section describes the epiBus signals. 

14.2.3.1 epiData<31 :0> 

Signal Type: Bidirectional (21071-BA, 21071-DA) 

Output Clock Edge: clklR 

input Sampling Clock Edge: clk2F 

epIData is a 32-bit bidirectional bus which connects the 21071-DA and the 
21071-BA chips. 

14.2.3.2 epiBEnErr<3:0> 

Signal Type: Bidirectional (21071-BA, 21071-DA) 

Output Clock Edge: clklR 

input Sampling Clock Edge: clk2F 

epiBEnErr<3:0> is timed with epiData. During epiBus transfers from the 
21071-DA chip to the 21071-BA chips, this field indicates which bytes of the 
longword on the epiData bus are valid. When an epiBEnErr<3:0> bit is set 
(high), the corresponding byte is valid. The byte enable is used for DMA write 
transfers and ignored on I/O read transfers. 

During epiBus transfers from the 21071-BA data chips to the 21071-DA chip, 
epiBEnErr<0> is asserted if the longword being sent on epiData contains 
a parity error or uncorrectable ECC error. epiBEnErr<l> is asserted if the 
longword being sent on epiData contained a correctable ECC error. Table 14-4 
lists theeplBEnErr functions. 



Table 14-4 epiBEnErr Functions 



Signal 



Transfers to 21071-BA 



Transfers from 21071-BA 



epiBEnErr<0> 



epi Data <7:0> byte enable 



epi B E nE rr<l> epi Data<15:8> byte enable 

epiBEnErr^> epiData<23:16> byte enable 

epiBEnErr<3> epi Data<31:24> byte enable 



DMA read or I/O write 
uncorrectable error (this 
longword) 

DMA read or I/O write 
corrected error (this longword) 

Reserved 

Reserved 
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14.2.3.3 epiFromlOB 



Signal Type: 21071-BA Input 

Signal Source: 21071-DA 

Input Sampling Clock Edge: clk2F 

epiFromlOB indicates the direction of epiData to the 21071-BA chips. When 
epiFromlOB is deasserted, only the 21071-BA chip selected with epiEnable 
drives epiData<31:0>and epiBEnErr<3:0>. When epiFromlOB is asserted, the 
21071-BA chips receive data on epiData<31:0>and epiBEnErr<3:0>. 



14.2.3.4 epiSelDMA 



Signal Type: 21071-BA I nput 

Signal Source: 21071-DA 

Input Sampling Clock Edge: clk2F 

epiSelDMA is used by the 21071-BA chips when epiFromlOB is asserted, 
to determine whether the destination of epiData is the DMA write buffer 
(epiSelDMA =high) or the I/O read buffer (epiSelDMA =low). 

14.2.3.5 epiEnable<1:0> 

Signal Type: 21071-BA Input 

Signal Source: 21071-DA 

Input Sampling Clock Edge: clk2F 

The epiEnable<l:0> signals are asserted by the 21071-DA chip to the 21071-BA 
chip to indicate that the 21071-DA is performing an epiBus transfer. When 
epiEnable is driven low, the epiData and epiBus control signals are ignored. 

epiEnable is used to determine which longword within the octaword has to be 
driven onto and received from the epiData bus in the following cycle. 

The command is always sent 1 cycle prior to the corresponding data. 
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Table 14-5 indicates the function performed by the 21071-BA chips based on 
the values of epIEnable, epIFromlOB, and epiSelDMA. 

Table 14-5 21071-BA epiBus Interface Function 



epiEnable 


epiFromlOB 


epiSelDMA 


Function 





X 


X 


No action, except for possible line 
invalidate; epiData is tri stated. 


1 





X 


The DMA read or I/O write buffer is 
driven onto epiData. 


1 


1 





epiData is loaded into the I/O read 
buffer. 


1 


1 


1 


epiData is loaded into the DMA write 
buffer. 



14.2.3.6 epiOWSel 

Signal Type: 21071-BA Input 

Signal Source: 21071-DA 

Input Sampling Clock Edge: clk2F 

epiOWSel is used by the 21071-BA chips to select the octaword within the 
cache line that has to be written or read using the epiData bus. When 
epiOWSel is 0, the lower octaword is selected. When epiOWSel is 1, the 
upper octaword is selected. 

14.2.3.7 epiLineSel<1:0> 

Signal Type: 21071-BA Input 

Signal Source: 21071-DA 

Input Sampling Clock Edge: clk2F 

epiLineSel<l:0>is used to select the cache line of the DMA read or I/O write 
buffer that has to be read onto the epiBus. epiLineSel<l:0> is also used to 
select the cache line of the DMA write buffer to be loaded from the epiBus. 
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14.2.3.8 ioLineSel<1:0> 



Signal Type: 21071-BA Input 

Signal Source: 21071-DA 

Input Sampling Clock Edge: clk2F 

ioLineSel<l:0> is used to select the cache line of the DMA read or I/O write 
buffer that has to be loaded from the sysBus. 



14.2.3.9 epiLinelnval 

Signal Type: 21071-BA Input 

Signal Source: 21071-DA 

Input Sampling Clock Edge: clk2F 

When epiLinelnval is asserted, all byte enables for the selected cache line will 
be cleared in the DMA write buffer. 

14.2.4 Memory Signals 

This section describes the memory signals. 

14.2.4.1 memData<31 :0>, memPar<0> 

Signal Type: Bidirectional (21071-BA, Memory) 
Input Sampling Clock Edge: memClkR 
Output Clock Edge: memClkR 

memData<31:0> is a bidirectional bus which provides data to and from the 
21071-BA chip and memory. memPar<0> is the corresponding parity bit. 

The 21071-BA chip is the default driver of memData<31:0>. memData<31:0> 
is driven during all transactions except memory reads. During reads, 
memData<31:0> is tristated on memClkR. The 21071-CA chip controls the 
turn-on and turn-off of the memData bus with drvMemData. The timing for 
driving out write data or latching in read data is controlled by the 21071-CA 
chip using memCmd<3:l>. 
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14.2.5 Miscellaneous/Clock Signals 

This section describes the miscellaneous and clock signals. 

14.2.5.1 Clk1x2 

Signal Type: 21071-BA Input 
Signal Source: Clock generator 

clklx2 is a clock input which supplies a clock at twice the frequency of the 
DECchip 21064 sysClkOutl, with a minimum period of 15 ns and a 50% duty 
cycle. 

14.2.5.2 clk2ref 

Signal Type: 21071-BA Input 
Signal Source: Clock generator 

clk2ref is a signal input which is low when the assertion of clklx2 corresponds 
to the assertion of sysClkOutl. The received signal must be setup to the 
assertion of elk 1x2. 

14.2.5.3 resetj 

Signal Type: 21071-BA Input 
Signal Source: External logic 

Input Clock Edge: Asynchronous on assertion, clklR on deassertion 

Assertion of reset_l sets all internal logic and state machines to their initialized 
states. 

14.2.5.4 testMode 

Signal Type: 21071-BA Input 

Signal Source: Test logic 

Input Clock Edge: Asynchronous 

Assertion of testMode places the chip into a mode for chip testing. testMode 
is intended to be used only during chip testing, and it must be tied low during 
normal system operation. 

testMode has a weak internal pull-down and a Schmitt trigger input. 
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14.2.5.5 tristatej 

Signal Type: 21071-BA Input 
Signal Source: External logic 
Input Clock Edge: Asynchronous 

Assertion of this signal tristates all output and bidirectional drivers. tristate_l 
Is Intended for use during chip testing and power-up. 

trlstate_l has a weak Internal pull-up and a Schmitt trigger Input. 

14.2.5.6 pTestout 

Signal Type: 21071-BA Output 
Signal Source: Test logic 
Output Clock Edge: Flow through 

The pTestout signal contains the output from the Parametric NAND tree, as 
required for testability. The testMode signal must be asserted for pTestout to 
be valid. pTestout Is Intended for use only during chip testing. 

14.2.5.7 eccMode 

Signal Type: 21071-BA 
Input Clock Edge: Static 

The eccMode signal Is an input to the 21071-BA chip which Indicates the 
type of error-checl<ing used on the module. eccMode tied high indicates that 
the 21071-BA chip must use the 7-blt ECC code used by the DECchip 21064; 
eccMode tied low Indicates that the 21071-BA chip must use longword parity 
checl<ing. See Section 15.2.6 for a description of how and when the 21071-BA 
chip performs data checl<s and corrections. 
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eccMode should be used only in conjunction with a 128-bit memory data bus 
(using four 21071-BA chips). 

Caution 



eccM ode tied high with wideMem tied low will result in UNDEFINED 
behavior and may cause damage to system hardware. 



eccMode has a weak internal pull-down and a Schmitt trigger input buffer. 
Note 



Changing eccMode after reset is deasserted may result in UNDEFINED 
behavior. 



14.2.5.8 wideMem 



Signal Type: 21071-BA Input 
Input Clock Edge: Static 

The wideMem signal is an input to the 21071-BA chip that indicates the width 
of the memory data bus. wideMem tied high indicates a 128-bit wide memory 
data bus (DECchip 21072 ); wideMem tied low indicates a 64-bit wide memory 
data bus (DECchip 21071 ). 

wideMem has a weal< internal pull-down and a Schmitt trigger input buffer. 

Note 

Changing wideMem after reset is deasserted may result in UNDEFINED 
behavior. 
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14.3 DECchip 21071-BA Pin Connection Table 

This section includes DECchip 21071-BA pin connection tables. 

Table 14-6 DECchip 21071-BA Pin Assignments for DECchip 21072 with 
Parity 







Module Trace Name 




21071-BA 


21071-BA 


21071-BA 


21071-BA 


21071-BA 


Pin Name 


Chip #3 


Cliip #2 


Cliip #1 


Cliip #0 


eccMode 


vss 


vss 


vss 


vss 


wideMem 


vcc 


vcc 


vcc 


vcc 


epiBEnErr<3:0> 


epiBEnErr<3:0> 


epiBEnErr<3:0> 


epiBEnErr<3:0> 


epiBEnErr<3:0> 


epiData<31:0> 


epiData<31:0> 


epiData<31:0> 


epiData<31:0> 


epiData<31:0> 


epiEnable<l> 


vss 


vss 


vss 


vss 


epiEnable<0> 


epiEnable<3> 


epiEnable<2> 


epiEnable<L> 


epiEnable<0> 



mennData<31:0> nnemData<127:96> nnennData<95:64> memData<63:32> mennData<31:0> 
mennPar<0> mennPar<3> mennPar<2> mennPar<L> mennPar<0> 



drvSysCSR 

drvSysData 

subCnnd<L> 

subCmd<0> 

sysData<63:32> 

sysData<31:0> 

sysPar<l> 

sysPar<0> 



vss 

drvSysData 

subCmdCommon 

subCmdB<l> 
1 

sysData<127:96> 
1 

sysChecl«21> 



vss 

drvSysData 

subCmdCommon 

subCmdA<l> 

_1 

sysData<95:64> 
_i 

sysCheck<L4> 



vss 

drvSysData 

subCmdCommon 

subCmdB<0> 

_1 

sysData<63:32> 
1 

sysCheck<7> 



drvSysCSR 

drvSysData 

subCmdCommon 

subCmdA<0> 
_1 

sysData<31:0> 
1 

sysCheck<0> 



^Tie off to vcc or vss with resistor 
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Table 14-7 DECchip Pin Assignments for DECchip 21072 with ECC 







lUlodule Trace Name 




21071-BA 


21071-BA 


21071-BA 


21071-BA 


21071-BA 


Pin Name 


Chip #3 


Cliip #2 


Chiip #1 


Chip #0 


eccM ode 


vcc 


vcc 


vcc 


MCC 


wideMem 


vcc 


vcc 


vcc 


VCC 


epiBEnErr<3:0> 


epiBEnErr<3:0> 


epiBEnErr<3:0> 


epiBEnErr<3:0> 


epiBEnErr<3:0> 


epiData<31:0> 


epiData<31:0> 


epiData<31:0> 


epiData<31:0> 


epiData<31:0> 


epiEnable<L> 


vss 


vss 


vss 


vss 


epiEnable<0> 


epiEnable<3> 


epiEnable<2> 


epiEnable<l> 


epiEnable<0> 


memData<31:0> 


memData<127;96> 


memData<95:64> 


memData<63:32> 


memData<31:0> 


memPar<0> 


N/C2 


N/C^ 


N/C2 


N/C2 


drvSysCSR 


vss 


vss 


vss 


drvSysCSR 


drvSysData 


drvSysData 


drvSysData 


drvSysData 


drvSysData 


subCmd<l> 


subCmdCommon 


subCmdCommon 


subCmdCommon 


subCmdCommon 


subCmd<0> 


subCmdB<l> 


subCmdA<l> 


subCmdB<0> 


subCmdA<0> 


sysData<63:57> 


memChecl«21:27> 


memCheck<14:20> 


memC h eck <7 : 13> 


memCheck<0:6> 


sysData<56:39> 


_i 


_i 


_i 


_i 


sysData<38:32> 


sysChecl«27:21> 


sysChecl«20:14> 


sysCheck<L3:7> 


sysCheck<6:0> 


sysData<31:0> 


sysData<L27:96> 


sysData<95:64> 


sysData<63:32 


sysData<31:0> 


sysPar<L:0> 


_i 


_i 


_i 


_i 


^Tie off to vcc or vss with resistor 








^N/C =not connected 
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Table 14-8 DECchip 21071-BA Pin Assignments for DECchip 21071 With 
Parity^ 



21071-BA 
Pin Name 


21071-BA 
Chip #1 


Module Trace Name 

21071-BA 
Cliip #0 



eccMode 
wideMem 



vss 
vss 



vss 
vss 



epiBEnErr<3:0> epiBEnErr<3:0> epiBEnErr<3:0> 

epiData<31:0> epiData<31:0> epiData<31:0> 

epiEnable<l> epiEnable<3> epiEnable<2> 

epiEnable<0> epiEnable<l> epiEnable<0> 

memData<31:0> memData<63:32> memData<31:0> 

memPar<0> memPar<l> manPar<0> 



drvSysCSR 

drvSysData 

subCmd<l> 

subCmd<0> 

sysData<63:32> 

sysData<31:0> 

sysPar<l> 

sysPar<0> 



vss 

drvSysData 

subCmdB<l> 

subCmdB<0> 

sysData<127:96> 

sysData<63:32> 

sysChecl«21> 

sysChecl«7> 



drvSysCSR 

drvSysData 

subCmdA<l> 

subCmdA<0> 

sysData<95:64> 

sysData<31:0> 

sysChecl«14> 

sysChecl«0> 



14.4 DECchip 21071-BA Pin Assignment 

The DECchip 21071-BA is a 208-pin plastic quad flat pack (PQFP). Figure 14-1 
shows the signal assignments. Sections 14.4.1 and 14.4.2 provide alphabetical 
and numerical pin listings. 



■^ DECchip 21071-BA does not support ECC with 64-bit memory. 
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Figure 14-1 DECchip 21071-BA Pinout Diagram 
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14.4.1 DECchip 21071-BA Alphabetical Pin Assignment List 

Table 14-9 lists the DECchip 21071-BA pins in alphabetical order. The 
following list describes the abbreviations used in the Type column of the table. 

• B = Bidirectional 

• I = Input 

• P = Power 

• 0= Output 



Table 14-9 Alphabetical Pin Assignment List 



Pin Name 



Pin 



Type 



Pin Name 



Pin 



Type 



clklx2 


133 




clk2ref 


135 




drvMemData 


63 




drvSysCSR 


74 




drvSysData 


73 




eccMode 


9 




epiBEnErrO> 


198 


B 


epiBEnErr<l> 


197 


B 


epiBEnErr-^> 


196 


B 


epiBEnErr<3> 


195 


B 


epiDataO> 


194 


B 


epiData<I> 


192 


B 


epiData'^> 


191 


B 


epiData<3> 


190 


B 


epiData<4> 


189 


B 


epiData<2> 


188 


B 


epi Data<6> 


187 


B 


epi Data'^> 


186 


B 


epi Data'^> 


185 


B 


epiData<^> 


184 


B 


epiData<10> 


181 


B 


epi Data <11> 


180 


B 


epi Data '^2> 


179 


B 


epi Data <^3> 


178 


B 


epi Data '^4> 


177 


B 


epiData<^5> 


176 


B 



epi Data<16> 


175 


epiData<17> 


174 


epiData<18> 


173 


epiData<19> 


171 


epi Data <^0> 


170 


epi Data <^1> 


169 


epi Data <22> 


168 


epi Data <23> 


167 


epiData<24> 


166 


epiData<25> 


165 


epi Data <26> 


164 


epi Data <27> 


163 


epi Data <28> 


162 


epiData<29> 


161 


epiData<30> 


160 


epi Data <31> 


159 


epiEnable^> 


3 


epiEnable^> 


4 


epiFromiOB 


204 


epiLineinval 


203 


epiLineSelO> 


201 


epiLineSel<^> 


200 


epiOWSel 


199 


epiSelDI^IA 


202 


inpVdd 


51 


inpVdd 


103 
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Pin Name 



Pin 



Type 



Pin Name 



Pin 



Type 



inpVdd 


134 


P 


inpVdd 


155 


P 


inpVdd 


207 


P 


inpVss 


11 


P 


inpVss 


49 


P 


inpVss 


52 


P 


inpVss 


104 


P 


inpVss 


132 


P 


inpVss 


156 


P 


inpVss 


208 


P 


ioLine5elO> 


80 




ioLineSel<l> 


77 




memCmd<L> 


64 




memCmd-^> 


65 




memCmd<3> 


66 




memData<^> 


12 


B 


memData<L> 


13 


B 


memData'^> 


14 


B 


memData<3> 


15 


B 


memData<4> 


17 


B 


memData<2> 


18 


B 


memData<a6> 


19 


B 


memData^> 


20 


B 


memData^> 


21 


B 


memData'^> 


22 


B 


memData<10> 


23 


B 


memData<^l> 


24 


B 


memData<12> 


25 


B 


memData<13> 


28 


B 



memData^4> 


29 


B 


memData<a5> 


30 


B 


memData<^6> 


31 


B 


memData<17> 


32 


B 


memData^£> 


33 


B 


memData^9> 


34 


B 


memData'^0> 


35 


B 


memData<21> 


36 


B 


memData<22> 


38 


B 


memData<23> 


39 


B 


memData^4> 


40 


B 


memData'^5> 


41 


B 


memData^6> 


42 


B 


memData<27> 


43 


B 


memData<28> 


44 


B 


memData-^9> 


45 


B 


memData<30> 


46 


B 


memData<31> 


47 


B 


memPar<0> 


48 


B 


outVdd 


2 


P 


outVdd 


27 


P 


outVdd 


50 


P 


outVdd 


54 


P 


outVdd 


79 


P 


outVdd 


102 


P 


outVdd 


106 


P 


outVdd 


131 


P 


outVdd 


153 


P 


outVdd 


158 


P 
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Pin Name 



Pin 



Type 



Pin Name 



Pin 



Type 



outVdd 


183 


P 


outVdd 


205 


P 


outVss 


1 


P 


outVss 


16 


P 


outVss 


26 


P 


outVss 


37 


P 


outVss 


53 


P 


outVss 


68 


P 


outVss 


78 


P 


outVss 


105 


P 


outVss 


89 


P 


outVss 


120 


P 


outVss 


130 


P 


outVss 


141 


P 


outVss 


154 


P 


outVss 


157 


P 


outVss 


172 


P 


outVss 


182 


P 


outVss 


193 


P 


outVss 


206 


P 


pTestout 


5 





reset_l 


8 


1 


sysCmdO> 


69 


1 


sysCmd<l> 


70 


1 


sysCmd<2> 


71 


1 


sysDataO> 


152 


B 


sysData<l> 


151 


B 


sysData<2> 


150 


B 


sysData<3> 


149 


B 


sysData<4> 


148 


B 



sysData^> 


147 


B 


sysData<6> 


146 


B 


sysData<^> 


145 


B 


sysData<^> 


144 


B 


sysData<^> 


143 


B 


sysData<10> 


142 


B 


sysData<ll> 


140 


B 


sysData<L2> 


139 


B 


sysData<^3> 


138 


B 


sysData<a4> 


137 


B 


sysData<L5> 


136 


B 


sysData<16> 


129 


B 


sysData<^7> 


128 


B 


sysData<^S> 


127 


B 


sysData<^9> 


126 


B 


sysData<20> 


125 


B 


sysData<21> 


124 


B 


sysData<22> 


123 


B 


sysData<23> 


17? 


B 


sysData<24> 


121 


B 


sysData<25> 


119 


B 


sysData<26> 


118 


B 


sysData<27> 


117 


B 


sysData<28> 


116 


B 


sysData<29> 


115 


B 


sysData<30> 


114 


B 


sysData<31> 


113 


B 


sysData<32> 


111 


B 


sysData<33> 


110 


B 


sysData<34> 


109 


B 
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Pin Name 



Pin 



Type 



Pin Name 



Pin 



Type 



sysData<35> 


108 


B 


sysData<36> 


107 


B 


sysData<37> 


101 


B 


sysData<38> 


100 


B 


sysData<39> 


99 


B 


sysData<40> 


98 


B 


sysData<41> 


97 


B 


sysData<42> 


96 


B 


sysData<43> 


95 


B 


sysData<a4> 


94 


B 


sysData<45> 


93 


B 


sysData<46> 


92 


B 


sysData<47> 


91 


B 


5ysData<48> 


90 


B 


5ysData<49> 


88 


B 


sysData<20> 


87 


B 


sysData'^l> 


86 


B 


sysData<22> 


85 


B 


sysData<23> 


84 


B 


sysData'^4> 


83 


B 


sysData-^5> 


82 


B 


sysData-^6> 


81 


B 


sysData<27> 


62 


B 


sysData<28> 


61 


B 


sysData<29> 


60 


B 



sysData<60> 


59 


sysData<a61> 


58 


sysData<62> 


57 


sysData<63> 


56 


syslORead 


67 


sysParO> 


112 


sysPar<l> 


55 


sysReadOW 


72 


subCmdO> 


75 


subCmd-^> 


76 


testMode 


7 


tri state 1 


6 


wideMem 


10 
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14.4.2 DECchip 21071-BA Numerical Pin Assignment List 

Table 14-10 lists the DECchip 21071-BA pins in numerical order. The 
following list describes the abbreviations used in the Type column of the 
table. 

• B = Bidirectional 

• I = Input 

• P = Power 

• 0= Output 



Table 14-10 DECchip 21071-BA Numerical Pin Assignment List 



Pin Name 



Pin 



Type 



Pin Name 



Pin 



Type 



outVss 


1 


P 


outVdd 


2 


P 


epiEnableO> 


3 


1 


epiEnable<L> 


4 


1 


pTestout 


5 





tri state 1 


6 


1 


testMode 


7 


1 


reset 1 


8 


1 


eccMode 


9 


1 


wideMem 


10 


1 


inpVss 


11 


P 


memDataO> 


12 


B 


memData<l> 


13 


B 


memData<2> 


14 


B 


memData<3> 


15 


B 


outVss 


16 


P 


memData<4> 


17 


B 


memData'<2> 


18 


B 


memData<a6> 


19 


B 


memData<^> 


20 


B 


memData-^> 


21 


B 


memData<^> 


22 


B 


memData'^0> 


23 


B 


memData^l> 


24 


B 


memData<12> 


25 


B 



outVss 


26 


P 


outVdd 


27 


P 


memData<^3> 


28 


B 


memData<14> 


29 


B 


memData<15> 


30 


B 


memData<L6> 


31 


B 


memData<^7> 


32 


B 


memData-^S> 


33 


B 


memData<^9> 


34 


B 


memData<20> 


35 


B 


memData'^l> 


36 


B 


outVss 


37 


P 


memData-^2> 


38 


B 


memData<23> 


39 


B 


memData<24> 


40 


B 


memData'^5> 


41 


B 


memData<26> 


42 


B 


memData<27> 


43 


B 


memData<28> 


44 


B 


memData<29> 


45 


B 


memData<30> 


46 


B 


memData<31> 


47 


B 


memPar<0> 


48 


B 


inpVss 


49 


P 


outVdd 


50 


P 
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Pin Name 



Pin 



Type 



Pin Name 



Pin 



Type 



inpVdd 


51 


P 


inpVss 


52 


P 


outVss 


53 


P 


outVdd 


54 


P 


sysPar<l> 


55 


B 


sysData<63> 


56 


B 


sysData<62> 


57 


B 


sysData<a61> 


58 


B 


sysData<a60> 


59 


B 


sysData<29> 


60 


B 


sysData<^8> 


61 


B 


sysData'^7> 


62 


B 


drvMemData 


63 




memCmd-^> 


64 




memCmd<a> 


65 




memCmd<3> 


66 




syslORead 


67 




outVss 


68 


P 


sysCmdO> 


69 




sysCmd<l> 


70 




sysCmd-^> 


71 




sysReadOW 


72 




drvSysData 


73 




drvSysCSR 


74 




subCmdO> 


75 




subCmd<l> 


76 




ioLineSel<I> 


77 




outVss 


78 


P 



outVdd 


79 


P 


ioLineSel<0> 


80 


1 


sysData<26> 


81 


B 


sysData<25> 


82 


B 


sysData<24> 


83 


B 


sysData<23> 


84 


B 


sysData<22> 


85 


B 


sysData<21> 


86 


B 


sysData<20> 


87 


B 


sysData<49> 


88 


B 


outVss 


89 


P 


sysData<48> 


90 


B 


sysData<47> 


91 


B 


sysData<46> 


92 


B 


sysData<45> 


93 


B 


sysData<44> 


94 


B 


sysData<43> 


95 


B 


sysData<42> 


96 


B 


sysData<41> 


97 


B 


sysData<aO> 


98 


B 


sysData<39> 


99 


B 


sysData<38> 


100 


B 


sysData<37> 


101 


B 


outVdd 


102 


P 


inpVdd 


103 


P 


inpVss 


104 


P 


outVss 


105 


P 


outVdd 


106 


P 
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Pin Name 



Pin 



Type 



Pin Name 



Pin 



Type 



5ysData<36> 


107 


B 


sysData<35> 


108 


B 


sysData<34> 


109 


B 


sysData<33> 


110 


B 


sysData<32> 


111 


B 


sysPar<0> 


n? 


B 


sysData<31> 


113 


B 


sysData<30> 


114 


B 


sysData<29> 


115 


B 


sysData<28> 


116 


B 


sysData<27> 


117 


B 


sysData<26> 


118 


B 


sysData<25> 


119 


B 


outVss 


120 


P 


sysData<24> 


121 


B 


sysData<^3> 


17? 


B 


sysData<^2> 


17^ 


B 


sysData<^l> 


124 


B 


sysData<^0> 


17S 


B 


sysData<19> 


126 


B 


sysData<18> 


127 


B 


sysData<^7> 


128 


B 


sysData'^6> 


129 


B 


outVss 


130 


P 


outVdd 


131 


P 


inpVss 


132 


P 


clklx2 


133 


i 


inpVdd 


134 


P 


clk2Ref 


I^S 


1 


sysData<L5> 


136 


B 


sysData<^4> 


137 


B 



sysData<13> 


138 


B 


sysData<12> 


139 


B 


sysData<ll> 


140 


B 


outVss 


141 


P 


sysData<10> 


142 


B 


sysData<^> 


143 


B 


sysData'<^> 


144 


B 


sysData<^> 


145 


B 


sysData<a6> 


146 


B 


sysData<2> 


147 


B 


sysData<4> 


148 


B 


sysData<3> 


149 


B 


sysData<2> 


150 


B 


sysData<l> 


151 


B 


sysDataO> 


152 


B 


outVdd 


153 


P 


outVss 


154 


P 


inpVdd 


155 


P 


inpVss 


156 


P 


outVss 


157 


P 


outVdd 


158 


P 


epiData<31> 


159 


B 


epiData<30> 


160 


B 


epiData<29> 


161 


B 


epiData<28> 


162 


B 


epiData<27> 


163 


B 


epiData'^6> 


164 


B 


epiData<25> 


165 


B 


epiData<24> 


166 


B 


epiData^3> 


167 


B 


epiData'^2> 


168 


B 
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Pin Name 



Pin 



Type 



Pin Name 



Pin 



Type 



epiData<21> 


169 


B 


eplData<20> 


170 


B 


epiData<19> 


171 


B 


outVss 


172 


P 


epiData<18> 


173 


B 


epiData^7> 


174 


B 


epiData'^6> 


175 


B 


epiData<^5> 


176 


B 


epiData<a4> 


177 


B 


epiData<^3> 


178 


B 


epiData<L2> 


179 


B 


epiData<^l> 


180 


B 


epiData<LO> 


181 


B 


outVss 


182 


P 


outVdd 


183 


P 


epiData<^> 


184 


B 


epiData'<^> 


185 


B 


epiData'^> 


186 


B 


epiData<6> 


187 


B 


epiData<2> 


188 


B 


epiData<4> 


189 


B 


epiData<3> 


190 


B 


epiData<2> 


191 


B 


epiData<l> 


192 


B 


outVss 


193 


P 


epiData<0> 


194 


B 


epiBEnErr<3> 


195 


B 



epiBEnErr<2> 


196 


B 


epiBEnErr<^> 


197 


B 


epiBEnErrO> 


198 


B 


epiOWSel 


199 




epiLineSel<l> 


200 




epiLineSelO> 


201 




epiSelDMA 


202 




epiLinelnval 


203 




epiFromlOB 


204 




outVdd 


205 


P 


outVss 


206 


P 


inpVdd 


207 


P 


inpVss 


208 


P 



14.5 DECchip 21071-BA Mechanical Specifications 

Figure 14-2 shows the package dimensions for the DECchip 21071-BA chip. 
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Figure 14-2 DECchip 21071-BA Package Dimensions 
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DECchip 21071-BA Architecture Overview 



This chapter describes the architecture of the DECchip 21071-BA chip. 
Figure 15-1 shows a block diagram of the DECchip 21071-BA chip. 

Figure 15-1 DECchip 21071-BA Block Diagram 
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15.1 Bus Widths 

This section describes tine bus widtlis. 

15.1.1 sysData Bus 

Each 21071-BA data chip has 64 pins for thesysData bus. The DECchip 21071 
and DECchip 21072 configured systems support only a 128-bit wide sysData 
bus; a 64-bit wide sysData bus is not supported. 

In a DECchip 21071 configuration, sysData pins on each 21071-BA chip are 
connected to the sysBus: 

• The lower 21071-BA data chip (#D) connects to sysData<31:0> (longword 0) 
and sysData<95:64> (longword 2). 

• The upper 21071-BA data chip (#1) connects to sysData<63:32> (longword 
1) and sysData<127:96> (longword 3). 

In a DECchip 21072 configuration, only the lower 32-bits of the sysData bus of 
each 21071-BA chip are used: 

• 21071-BA data chip M connects to longword (sysData<31:0>). 

• 21071-BA data chip #1 connects to longword 1 (sysData<63:32>). 

• 21071-BA data chip 4S. connects to longword 2 (sysData<95:64>). 

• 21071-BA data chip 4B connects to longword 3 (sysData<L27:96>). 

15.1.2 memData Bus 

The number of 21071-BA data chips used in a system depends on the width of 
the memData bus. If the width of the memData bus is 64-bits, two 21071-BA 
data chips are required (DECchip 21071 ). If the width of the memData bus is 
128-bits, four 21071-BA chips are required (DECchip 21072 ). 

Each 21071-BA data chip connects to 32 bits of memData. I n all systems: 

• 21071-BA #D connects to longword (memData<31:0>). 

• 21071-BA #1 connects to longword 1 (memData<53:32>). 
I n a 4-chip configured system: 

• 21071-BA 4S. connects to longword 2 (memData<95:64>). 

• 21071-BA *3 connects to longword 3 (memData<127:64>). 
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Each 21071-BA data chip needs to know the width of the memData bus for 
proper operation. This is obtained from the widelMem pin. The 21071-BA 
data chips do not need to l<now which longword they are connected to. The 
proper latching and driving of data is achieved by appropriately connecting the 
21071-CA and 21071-DA command signals (Section 14.3). 

15.1.3 epiData Bus 

Each 21071-BA data chip has 32 epiData pins. 

The epiData pins of all the 21071-BA data chips are tied together to form a 
32-bit wide epiData bus. 

15.2 Description of DECchip 21071-BA Architecture 

This section describes the DECchip 21071-BA architecture. 

15.2.1 Memory Read Buffer 

The memory read buffer is also used to store data that is read from memory 
before it is returned to the CPU on the sysBus or to DMA in the DMA read 
buffer. 

Each 21071-BA data chip stores four longwords worth of data and 
corresponding check bits in the memory read buffer. 

• I n a two 21071-BA data chip designed system, the total storage is eight 
longwords or a cache line. 

• A four 21071-BA data chip designed system contains an additional eight 
longwords of storage; however, this extra storage is not usable. 

15.2.2 I/O Read Buffer and Merge Buffer 

On CPU-initiated memory transactions, this buffer performs the merge buffer 
functions described in Section 3.1.7. On CPU-initiated I/O reads addressed 
to or through the 21071-DA chip, this buffer acts as the I/O read buffer. The 
loading of data into this buffer is therefore controlled by both the 21071-CA 
and 21071-DA chips. 

Each 21071-BA data chip contains four longwords of data and corresponding 
check bits. The check bits are meaningful only for merge data. The check bits 
are UNPREDICTABLE for I/O read data. 
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15.2.3 I/O Write Buffer and DMA Read Buffer 

This buffer can store up to four entries of data. Each entry has four longwords 
per 21071-BA data chip. Data from this buffer is sent out on the epiData bus. 
System designers may choose to allocate each entry of this buffer according to 
their needs. The 21071-DA chip may use the full cache line available in each 
entry. 

I n the 21071 or 21072 implementation, two entries of this buffer are allocated 
for I/O write data storage, and two entries are allocated for DMA read data 
storage. 

I n a two 21071-BA chip system, storing one cache line uses all four longwords 
of each DMA read buffer entry; in a four 21071-BA chip system it uses only 
two of the four longwords of each entry, but the extra storage is not accessible. 

The loading of each entry can be controlled separately, thus allowing maximum 
flexibility in allocating the buffer entries to the 21071-DA. 

The loading of this buffer is handled by the 21071-CA chip, with the address 
provided by the 21071-DA on ioLineSel<l:0>. The 21071-DA chip controls 
unloading of this buffer. 

15.2.4 DMA Write Buffer 

The DMA write buffer has four entries. Each entry contains four longwords 
per 21071-BA and corresponding byte masks. In a four 21071-BA data chip 
system, only half the storage per entry is used. The extra storage is not 
accessible. 

The DMA write buffer is loaded by the 21071-DA chip and is unloaded by the 
21071-CA chip during a DMA write transaction on the sysBus. The byte masks 
are used to merge the valid bytes of data written in the DMA write buffer 
with the background data from the cache line. The background data may be 
obtained from the Bcache or memory. 

15.2.5 Memory Write Buffer 

The memory write buffer has four entries. Each entry contains four longwords 
of data per 21071-BA and corresponding check bits. The memory write buffer 
is loaded by the 21071-CA sysBus interface and is unloaded by the 21071-CA 
memory controller. 
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15.2.6 Error Checking/Correction 

The 21071-BA chip performs error checking/correction (ECC) on DMA 
transactions. When memory or Bcache data is read because of a DIM A 
transaction (DIMA read or a DIMA write masl<ed), the data is checl<ed for 
parity/ECC errors. 

If ECC is enabled, and the Bcache/memory data contains a correctable error, 
the 21071-BA data chip sends corrected data to its destination (DMA read 
buffer for DMA reads, memory write buffer for MUXing with DMA write data 
for DMA writes). 

If the data contains an uncorrectable error (dual-bit ECC error or any parity 
error), then the 21071-DA is notified (for a DMA read), or bad ECC/parity is 
written bacl< into memory (for a DMA write). 

I n case of a DMA write masl<ed transaction, parity/ECC is calculated for the 
merged data going into the memory write buffer. 

The 21071-BA data chip uses the same ECC code as the DECchip 21064 
microprocessor. See the DECchip 21064 H ardware Rd'erence Manual for 
details. 

15.3 Data Path Logic 

This section describes the data path logic. 

15.3.1 epiBus 

The epiBus may be used to load the I/O read buffer or the DMA write buffer. 
I n addition to write data, byte masl<s are stored in the DMA write buffer. 

The epiBus may also be used to unload the DMA read buffer (which also serves 
as the I/O write buffer). 

15.3.2 sysBus Output Selectors 

Two levels of muxes select the output for the sysData bus. The first level 
selects the source for each longword of data and checl< bits, and the second 
level selects the 2 longwords to be driven on the sysData bus. 

The source is described in Table 15-1. I n 64-bit memory mode, the lower and 
upper mux worl< together to select longwords and 2 in the first cycle (while 
the other 21071-BA data chip selects longwords 1 and 3) and to select 4 and 
6 in the second cycle (while the other 21071-BA data chip selects longwords 5 
and 7). 
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Table 15-1 sysBus Output Sources 



Buffer Function 



Memory read DMA and CPU read, DMA write masked 

Merge LDx_L, STx_C 

M erge and memory read CPU write allocates 

I/O read CPU I/O space reads 

The lower 16 bits of the sysData bus are controlled by a special signal to enable 
the 21071-CA chip to drive the lower 16 bits on CSR reads from the 21071-CA 
chip while the 21071-BA data chips drive the remaining data lines. 
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16 

DECchip 21071-BA Transactions and 

Timing Diagrams 



This chapter describes the flow of data within the 21071-BA chip on various 
transactions on the sysBus, memory data bus, and epiBus. 

16.1 sysBus Transactions 

This section describes the sysBus transactions. 

16.1.1 CPU Memory Read 

Read data from memory is loaded into the memory read buffer by the memory 
control machine in the 21071-CA. This data is available, by default, when the 
sysBus controller enables the 21071-BA chips to drive the sysData bus. 

The sysBus controller sends sysReadOW to indicate when the 21071-BA chips 
must switch to the high-order octaword. 

16.1.2 CPU Memory Read with Victim 

The victim data is loaded from the sysBus into the memory write buffer 
through a holding latch. If the write buffer is full, the data is held in the 
holding latch until there is room for it in the write buffer. (The control for this 
is provided by the 21071-CA chip.) 

Read data from memory can be loaded into the memory read buffer 
independent of the loading of the memory write buffer. 

16.1.3 CPU Memory Write Allocate 

The CPU write data is loaded by the 21071-CA chip into the merge buffer 
through the holding pad latch. The merge buffer can never be full, so this 
loading does not stall. If the write is partial, read data from memory is loaded 
into the memory read buffer. If there is a victim, the victim data is written 
into the memory write buffer through the holding pad latch. 
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When all the data is in place (memory read data, CPU write data, and victim 
data), the appropriate longwords of the memory read buffer and the merge 
buffer are merged and sent out on sysData. 

16.1.4 CPU Memory Write Noncacheable/Noallocate 

The data from the sysBus is loaded into the memory write buffer through the 
holding latch. If the memory write buffer is full, the data has to stall. 

Data from the memory write buffer is unloaded by the memory control 
sequencer from the 21071-CA chip when it is ready to service the write. 

16.1.5 STx_C Hit 

The write data from the CPU is loaded into the merge buffer. If the address 
is a hit in the cache, the remaining data is read from the cache and is loaded 
into the unwritten longwords of the merge buffer. Data from the merge buffer 
is then sent out on the sysBus. 

16.1.6 STx_C Miss 

This is exactly like a CPU memory write. 

16.1.7 LDx_LHit 

Data is read from the cache and is loaded into the merge buffer. It is sent out 
on the sysBus from there. 

16.1.8 LDx_LMiss 

This is exactly like a CPU memory read. 

16.1.9 CPU Read From or Through the DECchip 21 071 -DA 

The 21071-DA chip sets the direction of the epIData bus to be from the 21071- 
DA chip to the 21071-BA chips. It sets the epIBus controls to indicate the I/O 
read buffer as the destination of the data. When the I/O read data is available, 
it is loaded into the I/O read buffer. The I/O read buffer has already been 
selected as the source of sysBus data by 21071-CA. The I/O read data is thus 
returned to the CPU. 

16.1.10 CPU Write To or Through the DECchip 21071-DA 

When an I/O write transaction is detected on the sysBus, the 21071-DA 
chip is required to set up the controls for the I/O write buffer to point to the 
appropriate entry of the I/O write buffer. The loading of data is controlled by 
the 21071-CA chip. 
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The 21071-DA chip sets the direction of the epi Data to point from the 21071-BA 
chip to the 21071-DA chip, and it extracts the data as needed by controlling 
the longword select bits and enabling the appropriate 21071-BA chips using 
epiEnable<3:0>. 

16.2 PCI and Other I/O Bus Transactions 

This section describes PCI and other I/O bus transactions. 

16.2.1 PCi Read from System Memory 

The 21071-DA chip performs a DMA read transaction on the sysBus and sets 
up the controls of the DMA read buffer to point to the appropriate entry of the 
DMA read buffer. 

The 21071-CA chip gets the data from memory or Bcache. If the data is to be 
read from memory, the memory read buffer is loaded as data is received from 
memory. Data from the memory read buffer is loaded into the DMA read buffer 
after error checking has happened. If the data will be read from the Bcache, 
the data is loaded from thesysBus via the holding pad latch into the DMA 
read buffer, after error checking has happened. 

The 21071-DA chip sets up the direction of the epi Data bus to be from the 
21071-BA chips to the 21071-DA chip whenever it is ready to receive data. As 
the data is loaded into the DMA read buffer, it is extracted by the 21071-DA 
chip. 

16.2.2 PCI Write to System Memory 

The direction of the epIData bus is set to be from the 21071-DA chip to the 
21071-BA chip by the 21071-DA chip. The appropriate controls for loading the 
correct write buffer entry are set. The write data and the corresponding byte 
masks are loaded into the selected entry as it is available. If for some reason 
the write is not valid, the 21071-DA chip can overwrite that entry by using 
the epILinel nval signal. epILinel nval should be used at the start of any DMA 
write that does not use the full cache line. 

Whenever the 21071-DA chip is ready to do the transaction on the sysBus, a 
DMA write is initiated. If the DMA write buffer contains completely unmasked 
data, the data from the DMA write buffer is moved to the memory write buffer 
after the proper error bits have been generated. 
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If the DMA write is partially masked, a read-modify-write is performed. Data 
is read from memory (cache miss) into the memory read buffer or from the 
sysBus (cache hit) and is merged with the data from the DMA write data 
based on the DMA write byte masl<s. Error checl<ing is performed on the read 
data. If there is no error or a correctable error (error is corrected in this case), 
checl< bits are generated for the merged data and are written to the memory 
write buffer. If there is an uncorrectable error in the read data, the merge is 
performed but incorrect checl< bits are written into the memory write buffer. A 
read from this location will result in a hard error later. 

16.3 epiBus Transactions 

This section describes the epiBus transactions. 

16.3.1 DMA Read Buffer to the 21 071 -DA 

The following table describes the cycles for an epiBus transaction which 
transfers data from the DMA read buffer to the 21071-DA, as shown in 
Figure 16-1. 

Cycle Description 

The 21071-DA chip may read data from the DMA read buffer after the 
data has been loaded by the 21071-CA chip. The earliest that data may be 
read out is two cycles after an ioCAcl«2:0>for that read or one cycle after 
sysReadOW for the octaword to be read. ioCAcl«2:0> in this cycle of the 
diagram indicates that data is ready by cycle 2. 

1 If ioCAck<2:0> was not sent in cycle 0, a sysReadOW Indicates that the first 
octaword of data may be read out in cycle 2. 

The 21071-DA chip recognizes that the data is going to be ready. It asserts 
the epiLineSel<l:0> lines to request a read of the DMA read buffer line 
which was indicated on ioLineSel when the read was started. The 21071-DA 
chip places a request for the first longword of read data by deasserting 
epiFromlOB (indicating a read), deasserting epiOWSel (indicating the first 
octaword), and asserting epiEnable<0>(LW within first octaword). If the 
21071-DA was driving epIData, it must tristate the bus by clk2F. 

2 The 21071-BA chip receives the epiBus control signals and begins driving 
epIData with the first longword of data. The 21071-BA also drives error 
information on epiBENErr<3:0>. See Table 8-7. 

The 21071-DA chip requests LW 1 by changing to assert epiEnable<l>. 
(Shown in figure as a 2, because epiEnable<3:0> = 0010 =2.) 

The 21071-DA chip receives and latches epiData<31:0>on clk2F. The 
21071-BA receives epiEnable<3:0> and tristates epiData<31:0> and 
epiBENErr<3:0>on clk2F. 
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Cycle Description 



3 Similar to cycle 2, the 21071-DA chip requests LW 2, and another 21071-BA 
chip drives LW 1. EpiData<31:0>and epiBEnErr<3:0>are always one cycle 
behind the EPI control lines. 

4 The 21071-DA chip requests LW 3; a 21071-BA chip drives LW 2. 

5 The 21071-DA chip requests LW 4; a 21071-BA chip drives LW 3. Because 
LW 4 is in the second octaword, epiOWSel asserts and epiEnable<0>is used. 

6 The read continues. There is no constraint on the order or number of times 
that a longword may be read out (as long as the LW is ready, as described in 
cycle 0). 
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Figure 16-1 Timing of DMA Read Buffer to the 21071-DA Transfer 
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16.3.2 I/O Write Buffer to 21 071 -DA 

An epiBus transaction that transfers data from the I/O write buffer to the 
21071-DA chip is identical to the previous case shown in Figure 16-1. Because 
the same buffer is used for both DMA reads and I/O writes, the only difference 
is that a different buffer line will be requested using epiLineSel<l:0>. (The 
line that was present on ioLineSel<l:0>when the I/O write occurred.) 

16.3.3 21071-DA to DMA Write Buffer 

The following table describes the cycles for an epiBus transaction that 
transfers data from the 21071-DA chip into the DMA write buffer, as shown in 
Figure 16-2. 

Cycle Description 

The 21071-DA chip places a request to store the first longword of DMA 
write data by asserting epiFromI OB (indicating a write into the 21071-BA), 
asserting epiSelDMA (indicating a DMA transfer), deasserting epiOWSel 
(indicating the first octaword), and asserting epiEnable<0> (LW within 
first octaword). The 21071-DA chip asserts the epiLineSel<l:0> lines to 
point to an empty line in the DMA read buffer. Because this is the first store 
to this DMA write buffer line, epiLlnelnval is asserted to clear all of the byte 
enables left over from the previous usage of the cache line. 

If a 21071-BA chip was driving epiData<31:0>, then it will tristate the bus 
byclk2F. 

1 The 21071-DA chip sends the data to be stored on epiData<31:0>. The 
21071-DA chip drives epiBENErr<3:0> with the byte enables for the 4 bytes 
in the longword. (epiBEnErr<3:0>is on if the byte is valid.) 

The 21071-BA chip receives the epiBus control signals and latches LW into 
the I/O read buffer. 

The 21071-DA chip requests that LW 1 be stored in the next cycle by 
changing to assert epiEnable<l>. 

2 Similar to cycle 1, the 21071-DA chip requests storing LW 2 and drives 
data for LW 1, and the 21071-BA chip latches LW 1. epiData<31:0>and 
epiBEnErr<3:0>are always one cycle behind the epiBus control lines. 

3 The 21071-DA chip requests storing LW 3; 21071-DA drives LW 2. 

4 The 21071-DA chip requests storing LW 4; 21071-DA drives LW 3. Because 
LW 4 is in the second octaword, epiOWSel asserts and epiEnable<0>is used. 

5 The 21071-DA chip requests storing LW 5; the 21071-DA drives LW 4. 
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Cycle Description 



The 21071-DA chip requests storing LW 6; the 21071-DA drives LW 5. 

If the 21071-DA can ensure that the last data will be sent by cycle 7, then it 
may request a DMA write transaction with the 21071-CA. By the time the 
21071-CA requires the DMA write data, it will have been loaded into the 
DMA write buffer. 

The stores continue. There is no constraint on the order or number of times 
that a longword may be stored. There is also no constraint that the entire 
cache line be loaded, because the epiLinelnval will set all of the byte enables 
that were not loaded to off. (This functionality allows an 21071-DA chip 
aggregate writes.) 
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Figure 16-2 Timing of 21071-DA to DMA Write Buffer Transfer 
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16.3.4 21 071 -DA to I/O Read Buffer 

The following table describes the cycles for an epiBus transaction that transfers 
data from the 21071-DA chip into the CPU I/O read buffer, as shown in 
Figure 16-3. 

Cycle Description 

It is presumed that a CPU read to I/O space has already begun, and that 
the 21071-DA chip recognizes that the read data is going to be ready. The 
21071-DA chip places a request to store the first longword of read data by 
asserting epiFromI OB (indicating an write into the 21071-BA), deasserting 
epiSelDMA (indicating an I/O transfer), deasserting epiOWSel (indicating 
the first octaword), and asserting epiEnable<0>(LW within first octaword). 
If the 21071-BA chip was driving epiData<31:0>, then it will tristate the bus 
by clk2F. The 21071-CA chip asserts syslORead to select the I/O read buffer 
onto the sysData bus. 

1 The 21071-DA chip sends the data to be stored on epiData<31:0>. The 
21071-DA chip drives epi BENE rr<3:0> with arbitrary values to prevent the 
bus from floating. 

The 21071-BA chip receives the epiBus control signals and latches LW into 
the I/O read buffer. 

The 21071-DA chip requests that LW 1 be stored in the next cycle by 
changing to assert epiEnable<l>. 

2 Similar to cycle 1, the 21071-DA chip requests storing LW 2, drives 
data for LW 1, and the 21071-BA chip latches LW 1. epiData<31:0>and 
epiBEnErr<3:0>are always one cycle behind the epiBus control lines. 

3 The 21071-DA chip requests storing LW 3; 21071-DA drives LW 2. 

4 The 21071-DA chip requests storing LW 4; 21071-DA drives LW 3. Because 
LW 4 is in the second octaword, epiOWSel asserts and epiEnable<0>is used. 

Because a full octaword of data will be stored in the 21071-BA chip 
by the end of cycle 4, the 21071-DA chip may send a cpuDRAcl«2:0> 
OK_NCACHE_NCHK request through the 21071-BA chip to the CPU. 

5 The 21071-DA chip requests storing LW 5; 21071-DA drives LW 4. 

The 21071-CA chip receives the cpuDRAck<2:0>OK_NCACHE_NCHK 
request and sends the OK on cpuDRAcl«2:0>. The 21071-BA chip sends the 
first octaword on sysData<31:0>totheCPU. 

6 The stores continue. There is no constraint on the order or number of times 
that a longword may be stored. There is also no constraint that an entire 
octaword be sent on epiData<31:0> before requesting a cpuDRAck<2:0>. (Of 
course, a cpuDRAck<2:0> cannot be requested before all of the data that 
needs to be sent has been transferred.) 
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Figure 16-3 Timing of 21071-DA to I/O Read Buffer Transfer 
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17 

DECchip 21071-BA Electrical Data 



This chapter includes the following information about the DECchip 21071-BA 
chip: 

• DC Electrical Data 

• AC Electrical Data 

17.1 DC Electrical Data 

This section describes the dc characteristics of the DECchip 21071-BA chip. 

17.1.1 Absolute Maximum Ratings 

Table 17-1 lists the maximum ratings of the DECchip 21071-BA chip. 
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Table 17-1 DECchip 21071-BA Maximum Ratings 



Characteristics IVIinimum Maximum 



Storage temperature 
Operating ambient temperature 

Air flow 

J unction temperature 



Supply voltage with respect to Vss, 
witii reset_l asserted 

Supply voltage witii respect to Vss, 
witii reset I deasserted 



Voltage on any pin witii respect to Vss 



Maximum power: 

(a/dd= 5.25 V 1.7 W 

(aCycle =30 ns 

^Ifpm = linear feet per minute 



-55°C (-67°F) 


125°C (257°F 


0°C (32°F) 


40°C (104°F 


100 Ifpm^ 


— 


25°C (77°F) 


100°C (212°F 


-0.5 V 


-f6.5 V 


4.75 V 


5.25 V 


-0.5 V 


Vdd +0.5V 
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Table 17-2 lists the dc parametric values of the DECchip 21071-BA chip. 
Table 17-2 DC Parametric Values 



Symbol 


Description 


lUlinimum 


lUlaximum 


Units 


Test Conditions 


^ih 


Input high voltage 


2.0 


— 


V 


— 


^il 


Input low voltage 


— 


0.8 


V 


— 


^oh 


Output high voltage 


2.4 


— 


V 


— 


^ol 


Output low voltage 


— 


0.4 


V 


— 


'il 


1 nput leakage current-^ 


-5 


5 


UA 


V <Vin <Vdd 


'ilpu 


1 nput leakage current^ 


-15 


-100 


iiA 


V <Vin <Vdd 


'ilpd 


1 nput leakage current^ 


15 


100 


HA 


V <Vin <Vdd 


'ol 


Output leakage current 
(tristated) 


-10 


10 


MA 


V <Vin <Vdd 



^Excluding drvSysCSR, eccMode, testMode, tristateL, and wideMem. 

^For tristateL. 

^For drvSysCSR, eccMode, testMode, and wideMem. 



17.2 AC Electrical Data 

This section describes the ac characteristics of the DECchip 21071-BA chip. 

17.2.1 Clocks 

The DECchip 21071-BA uses one clock (running at twice the nominal system 
frequency) plus a synchronous phase reference signal to generate five internal 
clock edges. See Figure 17-1, Figure 17-2, Table 17-3, and Table 17-4 for 
details about DECchip 21071-BA external clock requirements and internal 
clock phase relationships. 

A clock system must meet the requirements in Figure 17-1 and Table 17-4 
to guarantee the proper behavior of the 21071-BA chip's internal logic. The 
21071-BA chip does not specify the maximum skew allowed for external 
transfers to or from the CPU, Bcache PALs, Bcache, 21071-CA chip, or 
21071-DA chip because these skew limits are dependent on module placement 
and routing. A system designer must examine external transfers to determine 
the maximum clock skews allowed between chips. 
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The skew numbers shown in Figure 17-1 and Table 17-4 are given for a 30.0 
ns cycle time. At a longer cycle time, the allowable skew may be increased, 
as long as the given minimum times between clock edges are not violated. 
These skew limits assume that the 21071-BA chip adds another 0.1 ns of 
uncertainty between rising and falling edges due to non-ideal input buffer 
switching thresholds. 



Table 17-3 DECchip 21071-BA Clock AC Characteristics 

Parameter Minimum Maximum Unit Note 

System cycle time 
clklx2 period 
clklx2 frequency 
clklx2 rise time 
clklx2 fall time 



clk2ref setup to clklx2 rising 
clk2ref hold from clklx2 rising 



30 


— 


ns 


cin Figure 17-1 


15 


— 


ns 


— 


— 


66 


MHz 


— 


— 


1 


ns 


— 


— 


1 


ns 


— 


0.8 





ns 


Tsu in Figure 17-1 


1.8 


— 


ns 


Th in Figure 17-1 
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Figure 17-1 DECchip 21071-BA Clock Skew Requirements 



sysClkOutI 
clk1 



Internal edges: 
Internal memClk: 



clk1R 



clk2R 
memClkR 



■.5*c - 0.50 ns min - 
■.5*c + 0.50 ns max - 



clklF 



clk2F 
memClkR 



clk1R 
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Table 17-4 DECchip 21071-BA Clock Skew Limits at clk1x2 Pin 



Parameter 



Example Transfers 



Maximum Unit Note 



clklx2 rising edge to rising edge 

cll<lx2 falling edge to falling 
edge 

clklx2 rising edge to falling edge 
clklx2 falling edge to rising edge 



clklR toclklR, clklR toclklF, 0.50 
clklF toclklR, clklF toclklF 

clk2R toclk2R, clk2R toclk2F, 1.25 
clk2F toclk2R, clk2F toclk2F 

ClklR to clk2R, ClklR to clk2F, 1.60 
ClklF toclk2R, clklF toclk2F 

clk2R toclklR, clk2R toclklF, 1.60 
clk2F toclklR, clk2F toclklF 



ns 



ns 



ns 



ns 



@ Cycle : 
30 ns 

@ Cycle : 
30 ns 

@ Cycle: 
30 ns 

@ Cycle ■ 
30 ns 
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Figure 17-2 DECchip 21071-BA Clock Signals 
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The 21071-BA imposes no requirements on clkl or sysClkOutl. Skew on 
clkl will be constrained by limits imposed by external paths to or from the 
Bcache control PALs. The phase error between sysClkOutl and clklx2 will be 
constrained by limits imposed by external paths to or from the CPU chip. 

17.2.2 Signals 

Figure 17-3 and Figure 17-4 demonstrate the timing measurements specified 
in Table 17-6 and Table 17-7. 
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Figure 17-3 DECchip 21071-BA Output Delay Measurement 
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Figure 17-4 DECcliip 21071-BA Setup and IHolcl Time lUleasurement 
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The following ac electrical data is specified with respect to the appropriate edge 
at the clklx2 pin. Both the output delay table and the setup/hold timetable 
assume a 1 ns edge rate at the clklx2 pin. 

All outputs drive a 50 pF load. When estimating module delays, you may 
need to replace the 50 pF load delay with a simulated (or calculated) delay. 
The delays for 4 mA and 8 mA drivers driving a 50 pF load are provided in 
Table 17-5. See Table 14-1 for information about the buffer size of every 
output pin. 



Table 17-5 DECchip 21 071 -B A Output Buffer Delays into a 50 pF Load 



Type 



Minimum 



iUlaximum 



Unit 



4 mA 
8 mA 



3.5 
2.3 



7.6 
5.0 



ns 
ns 



Table 17-6 DECchip 21071-BA AC Characteristics (Valid Delay into a 50 pF 
Load) 



Signal 



Minimum Maximum Unit Reference Edge 



sysData<63:0>, 

sysPar<l:0>, 

sysChecl«5:0> 



5.9 



18.5 



ns 



clklR 



memData<31:0>, 

memPar<0>, 

memCheck<6:0> 



4.3 



16.8 



ns 



memClkR 



epiData<31:0>, 
epiBEnErr<3:0> 



4.9 



16.2 



ns 



clklR 
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Table 17-7 DECchip 21071-BA AC Characteristics (Setup/Hold Time) 



Signal 



Setup Hold Unit Reference Edge 



sysData<63:0>, 

sysPar<l:0>, 

sysCha:k<6:0> 



2.4 2.9 



ns 



clk2F 



memData<31:0>, 

memPar<0>, 

memChecl«6:0> 



0.6 4.4 



ns 



memClkR 



epiData<31:0>, 
epiBEnErr<3:0> 



1.0 5.2 ns 



clk2F 



sysCmd<2:0>, 
subCmd<l:0>, 
syslORead, 
sysReadOW 



0.6 5.0 ns clk2F 



drvSysData^, 
drvSysCSR 

drvSysData^ 



2.1 2.6 ns clklR 

2.1 2.6 ns clklF 



memCmd<3:l> 



2.1 5.2 ns 



clklR 



epiFromlOB, 
epiSelDMA 



1.0 5.2 



ns 



clk2F 



ioLlneSei, 
epILInelnva! 



1.0 5.2 



ns 



clk2F 



epiOWSeP, 
epiLineSel<l:0>^ 



1.0 5.2 



ns 



clk2F 



■"^ For dvrSysData asserting. 

^ For dvrSysData deasserting. 

2 These signals pass tlirougli a transparent latcli (closing on dl<2F) and then reach a cll<lR flip-flop. 
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Table 17-7 (Cont.) DECchip 21071-BA AC Characteristics (Setup/Hold Time) 

Signal Setup hHold Unit Reference Edge 

epiOWSeP, 9.7 — ns clklR 
epi Lines el <1:0>^ 

epiEnable<l:0>^ 1.0 5.2 ns clk2F 

epiEnable<l>^ 6.0 — ns clklR 

^ These signals pass through a transparent latch (closing on cll<2F) and then reach a cll<lR flip-flop. 
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18 



DECchip 21071-BA Power-Up and 

Initialization 



This chapter describes the behavior of the 21071-BA chip on power-up and 
assertion of reset_l. 

18.1 Power-Up 

On power-up, the reset_l input of the 21071-BA chip should be asserted. It 
should be l<ept asserted until the system clocl<s are up and running for 20 
cycles. 

18.2 Internal Reset 

The assertion and deassertion of the reset_l pin on the module is asynchronous 
to the DECchip 21071-BA. An internal reset signal is generated from reset_l 
which asserts asynchronously as soon as reset_l is asserted, but which 
deasserts synchronously. Due to the synchronous deassertion of the internal 
reset, the DECchip 21071-BA requires that no external transaction should 
start until 10 system clocl< cycles after the deassertion of reset_l. 

18.3 State of Pins on Reset Assertion 

The following are general rules and requirements for the behavior of the 
21071-BA at its pins during reset: 

• All input only control signals (except the clocks and resetj) should be in 
the deasserted state as long as reset is asserted. 

• All output only signals are deasserted. 

• All bidirectional signals are tristated. 

• wideMem and eccMode should be stable before reset_l deasserts and should 
never change thereafter. 
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The exceptions to these rules are as follows: 

• The value of memData<31:0> is unpredictable; the drive state depends on 
the state of the drvMemData input. 

• drvMemData is asserted by the 21071-CA during reset so memData<31:0> 
are driven by the 21071-BA. 

Note 



I n all cases, the assertion of tristate_l overrides the assertion of reset_l. 
That is, if tristate_l is asserted during reset, all the outputs of the 
chip go to their High-Z state. If resetj is still asserted when tristate_l 
deasserts, the signals return to the normal reset state described 
previously 
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Bcache PAL Equations 



This appendix provides Bcache PAL equations. 
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Table A-1 provides cache data write enable equations. 



Table A-1 Equations for Cache Data Write Enables 



bcDataWEB^ = 

bcDataWEB.OE 
bcDataWE2^ = 

bcDataWE2.0E 
bcDataWEl^ = 

bcDataWEl.OE 
bcDataWEO^ = 

bcDataWEO.OE 
!bcDataA4 3^ 2= 



bcDataA4_3.0E 
!bcDataA4 2^ ^ = 



bcDataA4_2.0E 
!bcDataA4 1^ ^ = 



bcDataA4 l.OE = 



(sysDataLongWE 
#(sysDataWEEn 
#cpuDataWE3; 

(( 1 »; 

(sysDataLongWE 
#(sysDataWEEn 
#cpuDataWE2; 

(( 1 »; 

(sysDataLongWE 
#(sysDataWEEn 
#cpuDataWEl; 

(( 1 »; 

(sysDataLongWE 
#(sysDataWEEn 
#cpuDataWEO; 

(( 1 )); 

( sysDataALEn & 
#( sysDataAHEn 
#( sysDataALEn 
#cpuDataA4; 

(( 1 »; 



( sysDataALEn & !clk2 ) 

#( sysDataAHEn & clk2 ) 

#( sysDataALEn & sysDataAHEn^ 

#cpuDataA4; 

= ((!)); 



& longWr) 

& !clkl & llongWr) 



& longWr) 

& !clkl & llongWr) 



& longWr) 

& Icikl & llongWr) 



& longWr) 

& Iclkl & llongWr) 



Iclk2 ) 

& clk2 ) 

& sysDataAHEn^ 



( sysDataALEn & 
#( sysDataAHEn 
#( sysDataALEn 
#cpuDataA4; 

(( 1 )); 



Iclk2 ) 

& clk2 ) 

& sysDataAHEn^ 



i# = OR, & =AND, ! =NOT 

^Cache address bit 4; these are 4 identical copies. 

•'This term is logically redundant but must be included to prevent glitches in the output. 
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Table A-1 (Cont.) Equations for Cache Data Write Enables 



!bcDataA4_0i ^ = ( sysDataALEn & !clk2 ) 

#(sysDataAHEn & clk2 ) 
#( sysDataALEn & sysDataAHEn^ ) 
#cpuDataA4; 

bcDataA4_0.OE = ((1 )); 

i# = OR, & =AND, ! =NOT 

^Cache address bit 4; these are 4 identical copies. 

^Tliis term is logically redundant but must be included to prevent glitches in the output. 
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Table A-2 provides tag and data output enable equations. 



Table A-2 Equations for the Tag and Data Output Enables 



bcTagCEOE^ 



bcTagCEOE.OE 
bcDataCEOEO^ : 



bcDataCEOEO.OE 
bcDataCEOEl^ = 



bcDataCEOEl.OE 
bcDataCEOEZ^ = 



bcDataCE0E2.0E 
bcDataCE0E3^ = 



bcDataCEOEB.OE 
cpuD0Ei2= 

cpuDOE.OE^^ = 



(sysTagOEEn & IsenseDls 
#cpuTagCEOE & IsenseDls 
#sysEarlyOEEn & cpuCReq2 & IsenseDls 
#sysEarlyOEEn & cpuCReql & IsenseDls 
#sysEarlyOEEn & cpuCReqO & IsenseDls); 

(( 1 »; 

(sysDataOEEn & IsenseDls 

#cpuDataCEOE & IsenseDls 

#sysEarlyOEEn & lcpuCReq2 & cpuCReqO & IsenseDls 

#sysEarlyOEEn & cpuCReq2 & IcpuCReqO & IsenseDls 

#sysEarlyOEEn & lcpuCReq2 & cpuCReql & IsenseDls) 

(( 1 )); 

(sysDataOEEn & IsenseDls 

#cpuDataCEOE & IsenseDls 

#sysEarlyOEEn & lcpuCReq2 & cpuCReqO & IsenseDls 

#sysEarlyOEEn & cpuCReq2 & IcpuCReqO & IsenseDls 

#sysEarlyOEEn & lcpuCReq2 & cpuCReql & IsenseDls) 

(( 1 )); 

(sysDataOEEn & IsenseDls 

#cpuDataCEOE & IsenseDls 

#sysEarlyOEEn & lcpuCReq2 & cpuCReqO & IsenseDls 

#sysEarlyOEEn & cpuCReq2 & IcpuCReqO & IsenseDls 

#sysEarlyOEEn & lcpuCReq2 & cpuCReql & IsenseDls) 

(( 1 )); 

(sysDataOEEn & IsenseDls 

#cpuDataCEOE & IsenseDls 

#sysEarlyOEEn & lcpuCReq2 & cpuCReqO & IsenseDls 

#sysEarlyOEEn & cpuCReq2 & IcpuCReqO & IsenseDls 

#sysEarlyOEEn & lcpuCReq2 & cpuCReql & IsenseDls;! 

(( 1 )); 

(sysEarlyOEEn & cpuCReq2 & cpuCReqO 
#sysDOE ); 

IsenseDls; 



i# = OR, & =AND, I =NOT 

^CPU output enable; must betristated when 3.3 V is not stable. 
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Note 

In addition to the two PALs, theDECchip 21071 and DECchip 21072 
chipsets also require two NOR gates to control the cache. These may 
be implemented using NORs or unused portions of the PALs. 



Table A-3 provides Bcache and NOR gate equations. 



Table A-3 


Equations for Bcache and NOR Gates 




tagCtlWEJ 
tagAdrWE_ 


1 _ 
1' = 




! ( cpuTagCtlWe 
! ( sysTagWE ) 


+ sysTagWE ) 


i# = OR, & = 


= AND, ! 


= NOT 
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B 



Technical Support and Ordering 

Information 



B.1 Technical Support 

If you need technical support or help deciding which literature best 
meets your needs, call the Digital Semiconductor Information Line: 

United States and Canada 1-800-332-2717 
Outside North America +1-508-628-4760 

B.2 Ordering Digital Semiconductor Products 

To order the DECchip 21071 and DECchip 21072 core logic chipsets, contact 
your local distributor. 

B.3 Ordering Associated Literature 

For a complete list of available Digital Semiconductor literature contact the 
Digital Seminconductor Information Line. 
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